Hydrazinyl-Substituted Heteroaryl Compounds and Methods for Producing a Conjugate

ABSTRACT

The present disclosure provides conjugate structures and hydrazinyl-substituted heteroaryl compounds used to produce these conjugates. The disclosure also encompasses methods of production of such conjugates and compounds, as well as methods of using the same.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No. 62/310,497, filed Mar. 18, 2016, the disclosure of which is incorporated herein by reference.

INTRODUCTION

The field of protein-small molecule therapeutic conjugates has advanced greatly, providing a number of clinically beneficial drugs with the promise of providing more in the years to come. Protein-conjugate therapeutics can provide several advantages, due to, for example, specificity, multiplicity of functions and relatively low off-target activity, resulting in fewer side effects. Chemical modification of proteins may extend these advantages by rendering them more potent, stable, or multimodal.

A number of standard chemical transformations are commonly used to create and manipulate post-translational modifications on proteins. There are a number of methods where one is able to modify the side chains of certain amino acids selectively. For example, carboxylic acid side chains (aspartate and glutamate) may be targeted by initial activation with a water-soluble carbodiimide reagent and subsequent reaction with an amine Similarly, lysine can be targeted through the use of activated esters or isothiocyanates, and cysteine thiols can be targeted with maleimides and α-halo-carbonyls.

One significant obstacle to the creation of a chemically altered protein therapeutic or reagent is the production of the protein in a biologically active, homogenous form. Conjugation of a drug or detectable label to a polypeptide can be difficult to control, resulting in a heterogeneous mixture of conjugates that differ in the number of drug molecules attached and in the position of chemical conjugation. In some instances, it may be desirable to control the site of conjugation and/or the drug or detectable label conjugated to the polypeptide using the tools of synthetic organic chemistry to direct the precise and selective formation of chemical bonds on a polypeptide.

SUMMARY

The present disclosure provides conjugate structures and hydrazinyl-substituted heteroaryl compounds used to produce these conjugates. The disclosure also encompasses methods of production of such conjugates and compounds, as well as methods of using the same.

Aspects of the present disclosure include a conjugate comprising at least one modified amino acid residue of formula (I):

wherein:

R¹ and R² are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

one of R³ and R⁴ is a polypeptide and the other is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl;

Z¹ is selected from CR⁷, N, O and S;

Z² is C or N;

R⁷ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

L is a linker; and

W is a drug or a detectable label.

In some embodiments, Z¹ and Z² are each N.

In some embodiments, Z¹ is CR⁷ and Z² is N.

In some embodiments, Z¹ is O and Z² is C.

In some embodiments, R³ is the polypeptide.

In some embodiments, R⁴ is the polypeptide.

In some embodiments, R⁵ and R⁶ are each independently selected from alkyl and substituted alkyl.

In some embodiments, R⁷ is hydrogen.

In some embodiments, the linker is of the formula -(T¹-V¹)_(a)-(T²-V²)_(b)-(T³-V³)_(c)-(T⁴-V⁴)_(d)-(T⁵-V⁵)_(e)-, wherein:

a, b, c, d and e are each independently 0 or 1, where the sum of a, b, c, d and e is 1 to 5;

T¹, T², T³, T⁴ and T⁵ are each independently selected from (C₁-C₁₂)alkyl, substituted (C₁-C₁₂)alkyl, (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4-amino-piperidinyl (4AP), para-aminobenzyl (PAB), para-amino-benzyloxy (PABO), meta-amino-benzyloxy (MABO), para-amino-benzyloxycarbonyl (PABC), meta-amino-benzyloxycarbonyl (MABC), an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, (AA)_(p)-MABO, (AA)_(p)-MABC, (AA)_(p)-PABO, (AA)_(p)-PABC, MABO-(AA)_(p), MABC-(AA)_(p), PABO-(AA)_(p), PABC-(AA)_(p), (AA)_(p)-MABO-(AA)_(p), (AA)_(p)-MABC-(AA)_(p), (AA)_(p)-PABO-(AA)_(p), and (AA)_(p)-PABC-(AA)_(p);

V¹, V², V³, V⁴ and V⁵ are each independently selected from the group consisting of a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—;

each R¹¹ and R¹³ is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

w is an integer from 1 to 20;

n is an integer from 1 to 30;

p is an integer from 1 to 20; and

h is an integer from 1 to 12.

In some embodiments:

EDA is an ethylene diamine having the structure:

wherein q is an integer from 1 to 6, r is 0 or 1, and each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or wherein two adjacent R¹² groups are cyclically linked to form a piperazinyl ring;

PEG is a polyethylene glycol or a substituted polyethylene glycol;

AA is an amino acid residue; and

4AP is

wherein each R¹⁴ is independently selected from hydrogen, alkyl, substituted alkyl, polyethylene glycol, substituted polyethylene glycol, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In some embodiments, each R¹¹ is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In some embodiments, each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or wherein two adjacent R¹² groups are cyclically linked to form a piperazinyl ring.

In some embodiments, each R¹³ is independently selected from hydrogen, alkyl, substituted alkyl, aryl, and substituted aryl.

In some embodiments, each R¹⁴ is independently selected from hydrogen, alkyl, substituted alkyl, polyethylene glycol, substituted polyethylene glycol, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In some embodiments, T¹, T², T³,T⁴ and T⁵ and V¹, V², V³, V⁴ and V⁵ are selected from the following table:

T¹ V¹ T² V² T³ V³ T⁴ V⁴ T⁵ V⁵ (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —NR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (EDA)_(w) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —SO₂— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (CR¹³OH)_(h) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— — — (C₁-C₁₂)alkyl —CONR¹¹— substituted —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl (C₁-C₁₂)alkyl —SO2— (C₁-C₁₂)alkyl —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl — (CR¹³OH)_(h) —CONR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —P(O)OH— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— (C₁-C₁₂)alkyl —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- —NR¹¹— (C₁-C₁₂)alkyl —CO— PABC (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- — — — PABC- (AA)_(p) (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC- — (AA)_(p) (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABO — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABO — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC —NR¹¹— — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC- — (AA)_(p) (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — —.

In some embodiments, the linker is selected from one of the following structures:

wherein:

each f is independently 0 or an integer from 1 to 12;

each n is independently 0 or an integer from 1 to 30;

each y is independently 0 or an integer from 1 to 20;

each h is independently 0 or an integer from 1 to 12;

each p is independently 0 or an integer from 1 to 20;

each R is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; and

each R′ is independently selected from hydrogen, a sidechain group of an amino acid, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

Aspects of the present disclosure include a compound of formula (II):

wherein:

R⁸ is H and R⁹ is —(CR²R⁴)(NR⁶)(NHR⁵), or R⁸ is —(CR¹R³)(NR⁵)(NHR⁶) and R⁹ is H;

R¹, R², R³ and R⁴ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl;

Z¹ is selected from CR⁷, N, O and S;

Z² is C or N;

R⁷ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

L is a linker; and

W is a drug or a detectable label.

In some embodiments, Z¹ and Z² are each N.

In some embodiments, Z¹ is CR⁷ and Z² is N.

In some embodiments, Z¹ is O and Z² is C.

In some embodiments, R⁵ and R⁶ are each independently selected from alkyl and substituted alkyl.

In some embodiments, R⁷ is hydrogen.

In some embodiments, the linker is of the formula -(T¹-V¹)_(a)-(T²-V²)_(b)-(T³-V³)_(c)-(T⁴-V⁴)_(d)-(T⁵-V⁵)_(e)-, wherein:

a, b, c, d and e are each independently 0 or 1, where the sum of a, b, c, d and e is 1 to 5;

T¹, T², T³, T⁴ and T⁵ are each independently selected from (C₁-C₁₂)alkyl, substituted (C₁-C₁₂)alkyl, (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4-amino-piperidinyl (4AP), para-aminobenzyl (PAB), para-amino-benzyloxy (PABO), meta-amino-benzyloxy (MABO), para-amino-benzyloxycarbonyl (PABC), meta-amino-benzyloxycarbonyl (MABC), an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, (AA)_(p)-MABO, (AA)_(p)-MABC, (AA)_(p)-PABO, (AA)_(p)-PABC, MABO-(AA)_(p), MABC-(AA)_(p), PABO-(AA)_(p), PABC-(AA)_(p), (AA)_(p)-MABO-(AA)_(p), (AA)_(p)-MABC-(AA)_(p), (AA)_(p)-PABO-(AA)_(p), and (AA)_(p)-PABC-(AA)_(p);

V¹, V², V³, V⁴ and V⁵ are each independently selected from the group consisting of a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—;

each R¹¹ and R¹³ are independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

w is an integer from 1 to 20;

n is an integer from 1 to 30;

p is an integer from 1 to 20; and

h is an integer from 1 to 12.

In some embodiments:

EDA is an ethylene diamine having the structure:

wherein q is an integer from 1 to 6, r is 0 or 1, and each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or wherein two adjacent R¹² groups are cyclically linked to form a piperazinyl ring;

PEG is a polyethylene glycol or a substituted polyethylene glycol;

AA is an amino acid residue; and

4AP is

wherein each R¹⁴ independently selected from hydrogen, alkyl, substituted alkyl, polyethylene glycol, substituted polyethylene glycol, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In some embodiments, each R11 is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In some embodiments, each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or wherein any two adjacent R¹² groups are cyclically linked to form a piperazinyl ring.

In some embodiments, each R¹³ is independently selected from hydrogen, alkyl, substituted alkyl, aryl, and substituted aryl.

In some embodiments, each R¹⁴ is independently selected from hydrogen, alkyl, substituted alkyl, polyethylene glycol, substituted polyethylene glycol, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In some embodiments, T¹, T², T³, T⁴ and T⁵ and V¹, V², V³, V⁴ and V⁵ are selected from the following table:

T¹ V¹ T² V² T³ V³ T⁴ V⁴ T⁵ V⁵ (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —NR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (EDA)_(w) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —SO₂— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (CR¹³OH)_(h) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— — — (C₁-C₁₂)alkyl —CONR¹¹— substituted —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl (C₁-C₁₂)alkyl —SO2— (C₁-C₁₂)alkyl —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl — (CR¹³OH)_(h) —CONR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —P(O)OH— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— (C₁-C₁₂)alkyl —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- —NR¹¹— (C₁-C₁₂)alkyl —CO— PABC (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- — — — PABC- (AA)_(p) (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC- — (AA)_(p) (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABO — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABO — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC —NR¹¹— — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC- — (AA)_(p) (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — —.

In some embodiments, the linker is selected from one of the following structures:

wherein:

each f is independently 0 or an integer from 1 to 12;

each n is independently 0 or an integer from 1 to 30;

each y is independently 0 or an integer from 1 to 20;

each h is independently 0 or an integer from 1 to 12;

each p is independently 0 or an integer from 1 to 20;

each R is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; and

each R′ is independently selected from hydrogen, a sidechain group of an amino acid, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

Aspects of the present disclosure include a method of producing a polypeptide conjugate, where the method includes combining in a reaction mixture: a compound of the present disclosure, and a polypeptide that includes a reactive aldehyde group or a reactive ketone group. The combining is under reaction conditions suitable to promote reaction between the compound and the reactive aldehyde group or reactive ketone group of the polypeptide to form a polypeptide conjugate. The method further includes isolating the polypeptide conjugate from the reaction mixture.

Aspects of the present disclosure include a pharmaceutical composition, which includes a conjugate of the present disclosure, and a pharmaceutically acceptable excipient.

Aspects of the present disclosure include a method of delivering a conjugate to a subject. The method includes administering to the subject an effective amount of a conjugate of the present disclosure.

Aspects of the present disclosure include a method of treating a condition in a subject. The method includes administering to the subject having the condition a therapeutically effective amount of a pharmaceutical composition comprising a conjugate of the present disclosure, where the administering is effective to treat the condition in the subject.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a hydrophobic interaction column (HIC) trace of an antibody-drug conjugate produced by conjugating an aldehyde-tagged antibody to a maytansine modified to include a hydrazinyl-substituted imidazolyl coupling moiety, according to embodiments of the present disclosure.

DEFINITIONS

The following terms have the following meanings unless otherwise indicated. Any undefined terms have their art recognized meanings.

“Alkyl” refers to monovalent saturated aliphatic hydrocarbyl groups having from 1 to 10 carbon atoms and such as 1 to 6 carbon atoms, or 1 to 5, or 1 to 4, or 1 to 3 carbon atoms. This term includes, by way of example, linear and branched hydrocarbyl groups such as methyl (CH₃—), ethyl (CH₃CH₂—), n-propyl (CH₃CH₂CH₂—), isopropyl ((CH₃)₂CH—), n-butyl (CH₃CH₂CH₂CH₂—), isobutyl ((CH₃)₂CHCH₂—), sec-butyl ((CH₃)(CH₃CH₂)CH—), t-butyl ((CH₃)₃C—), n-pentyl (CH₃CH₂CH₂CH₂CH₂—), and neopentyl ((CH₃)₃CCH₂—).

The term “substituted alkyl” refers to an alkyl group as defined herein wherein one or more carbon atoms in the alkyl chain (except the C₁ carbon) have been optionally replaced with a heteroatom such as —O—, —N—, —S—, —S(O)_(n)— (where n is 0 to 2), —NR— (where R is hydrogen or alkyl) and having from 1 to 5 substituents selected from the group consisting of alkoxy, substituted alkoxy, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, acyl, acylamino, acyloxy, amino, aminoacyl, aminoacyloxy, oxyaminoacyl, azido, cyano, halogen, hydroxyl, oxo, thioketo, carboxyl, carboxylalkyl, thioaryloxy, thioheteroaryloxy, thioheterocyclooxy, thiol, thioalkoxy, substituted thioalkoxy, aryl, aryloxy, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, hydroxyamino, alkoxyamino, nitro, —SO-alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-aryl, —SO₂-heteroaryl, and —NR^(a)R^(b), wherein R′ and R″ may be the same or different and are chosen from hydrogen, optionally substituted alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, aryl, heteroaryl and heterocyclic.

“Alkylene” refers to divalent aliphatic hydrocarbyl groups preferably having from 1 to 6 and more preferably 1 to 3 carbon atoms that are either straight-chained or branched, and which are optionally interrupted with one or more groups selected from —O—, —NR¹⁰—, —NR¹⁰C(O)—, —C(O)NR¹⁰— and the like. This term includes, by way of example, methylene (—CH₂—), ethylene (—CH₂CH₂—), n-propylene (—CH₂CH₂CH₂—), iso-propylene (—CH₂CH(CH₃)—), (—C(CH₃)₂CH₂CH₂—), (—C(CH₃)₂CH₂C(O)—), (—C(CH₃)₂CH₂C(O)NH—), (—CH(CH₃)CH₂—), and the like.

“Substituted alkylene” refers to an alkylene group having from 1 to 3 hydrogens replaced with substituents as described for carbons in the definition of “substituted” below.

The term “alkane” refers to alkyl group and alkylene group, as defined herein.

The term “alkylaminoalkyl”, “alkylaminoalkenyl” and “alkylaminoalkynyl” refers to the groups R′NHR″— where R′ is alkyl group as defined herein and R″ is alkylene, alkenylene or alkynylene group as defined herein.

The term “alkaryl” or “aralkyl” refers to the groups -alkylene-aryl and -substituted alkylene-aryl where alkylene, substituted alkylene and aryl are defined herein.

“Alkoxy” refers to the group —O-alkyl, wherein alkyl is as defined herein. Alkoxy includes, by way of example, methoxy, ethoxy, n-propoxy, isopropoxy, n-butoxy, t-butoxy, sec-butoxy, n-pentoxy, and the like. The term “alkoxy” also refers to the groups alkenyl-O—, cycloalkyl-O—, cycloalkenyl-O—, and alkynyl-O—, where alkenyl, cycloalkyl, cycloalkenyl, and alkynyl are as defined herein.

The term “substituted alkoxy” refers to the groups substituted alkyl-O—, substituted alkenyl-O—, substituted cycloalkyl-O—, substituted cycloalkenyl-O—, and substituted alkynyl-O— where substituted alkyl, substituted alkenyl, substituted cycloalkyl, substituted cycloalkenyl and substituted alkynyl are as defined herein.

The term “alkoxyamino” refers to the group —NH-alkoxy, wherein alkoxy is defined herein.

The term “haloalkoxy” refers to the groups alkyl-O— wherein one or more hydrogen atoms on the alkyl group have been substituted with a halo group and include, by way of examples, groups such as trifluoromethoxy, and the like.

The term “haloalkyl” refers to a substituted alkyl group as described above, wherein one or more hydrogen atoms on the alkyl group have been substituted with a halo group. Examples of such groups include, without limitation, fluoroalkyl groups, such as trifluoromethyl, difluoromethyl, trifluoroethyl and the like.

The term “alkylalkoxy” refers to the groups -alkylene-O-alkyl, alkylene-O-substituted alkyl, substituted alkylene-O-alkyl, and substituted alkylene-O-substituted alkyl wherein alkyl, substituted alkyl, alkylene and substituted alkylene are as defined herein.

The term “alkylthioalkoxy” refers to the group -alkylene-S-alkyl, alkylene-S-substituted alkyl, substituted alkylene-S-alkyl and substituted alkylene-S-substituted alkyl wherein alkyl, substituted alkyl, alkylene and substituted alkylene are as defined herein.

“Alkenyl” refers to straight chain or branched hydrocarbyl groups having from 2 to 6 carbon atoms and preferably 2 to 4 carbon atoms and having at least 1 and preferably from 1 to 2 sites of double bond unsaturation. This term includes, by way of example, bi-vinyl, allyl, and but-3-en-1-yl. Included within this term are the cis and trans isomers or mixtures of these isomers.

The term “substituted alkenyl” refers to an alkenyl group as defined herein having from 1 to 5 substituents, or from 1 to 3 substituents, selected from alkoxy, substituted alkoxy, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, acyl, acylamino, acyloxy, amino, substituted amino, aminoacyl, aminoacyloxy, oxyaminoacyl, azido, cyano, halogen, hydroxyl, oxo, thioketo, carboxyl, carboxylalkyl, thioaryloxy, thioheteroaryloxy, thioheterocyclooxy, thiol, thioalkoxy, substituted thioalkoxy, aryl, aryloxy, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, hydroxyamino, alkoxyamino, nitro, —SO-alkyl, —SO-substituted alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-substituted alkyl, —SO₂-aryl and —SO₂-heteroaryl.

“Alkynyl” refers to straight or branched monovalent hydrocarbyl groups having from 2 to 6 carbon atoms and preferably 2 to 3 carbon atoms and having at least 1 and preferably from 1 to 2 sites of triple bond unsaturation. Examples of such alkynyl groups include acetylenyl (—C≡CH), and propargyl (—CH₂C≡CH).

The term “substituted alkynyl” refers to an alkynyl group as defined herein having from 1 to 5 substituents, or from 1 to 3 substituents, selected from alkoxy, substituted alkoxy, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, acyl, acylamino, acyloxy, amino, substituted amino, aminoacyl, aminoacyloxy, oxyaminoacyl, azido, cyano, halogen, hydroxyl, oxo, thioketo, carboxyl, carboxylalkyl, thioaryloxy, thioheteroaryloxy, thioheterocyclooxy, thiol, thioalkoxy, substituted thioalkoxy, aryl, aryloxy, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, hydroxyamino, alkoxyamino, nitro, —SO-alkyl, —SO-substituted alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-substituted alkyl, —SO₂-aryl, and —SO₂-heteroaryl.

“Alkynyloxy” refers to the group —O-alkynyl, wherein alkynyl is as defined herein. Alkynyloxy includes, by way of example, ethynyloxy, propynyloxy, and the like.

“Acyl” refers to the groups H—C(O)—, alkyl-C(O)—, substituted alkyl-C(O)—, alkenyl-C(O)—, substituted alkenyl-C(O)—, alkynyl-C(O)—, substituted alkynyl-C(O)—, cycloalkyl-C(O)—, substituted cycloalkyl-C(O)—, cycloalkenyl-C(O)—, substituted cycloalkenyl-C(O)—, aryl-C(O)—, substituted aryl-C(O)—, heteroaryl-C(O)—, substituted heteroaryl-C(O)—, heterocyclyl-C(O)—, and substituted heterocyclyl-C(O)—, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein. For example, acyl includes the “acetyl” group CH₃C(O)—

“Acylamino” refers to the groups —NR²⁰C(O)alkyl, —NR²⁰C(O)substituted alkyl, NR²⁰C(O)cycloalkyl, —NR²⁰C(O)substituted cycloalkyl, —NR²⁰C(O)cycloalkenyl, —NR²⁰C(O)substituted cycloalkenyl, —NR²⁰C(O)alkenyl, —NR²⁰C(O)substituted alkenyl, —NR²⁰C(O)alkynyl, —NR²⁰C(O)substituted alkynyl, —NR²⁰C(O)aryl, —NR²⁰C(O)substituted aryl, —NR²⁰C(O)heteroaryl, —NR²⁰C(O)substituted heteroaryl, —NR²⁰C(O)heterocyclic, and —NR²⁰C(O)substituted heterocyclic, wherein R²⁰ is hydrogen or alkyl and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

“Aminocarbonyl” or the term “aminoacyl” refers to the group —C(O)NR²¹R²², herein R²¹ and R²² independently are selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R²¹ and R²² are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

“Aminocarbonylamino” refers to the group —NR²¹C(O)NR²²R²³ where R²¹, R²², and R²³ are independently selected from hydrogen, alkyl, aryl or cycloalkyl, or where two R groups are joined to form a heterocyclyl group.

The term “alkoxycarbonylamino” refers to the group —NRC(O)OR where each R is independently hydrogen, alkyl, substituted alkyl, aryl, heteroaryl, or heterocyclyl wherein alkyl, substituted alkyl, aryl, heteroaryl, and heterocyclyl are as defined herein.

The term “acyloxy” refers to the groups alkyl-C(O)O—, substituted alkyl-C(O)O—, cycloalkyl-C(O)O—, substituted cycloalkyl-C(O)O—, aryl-C(O)O—, heteroaryl-C(O)O—, and heterocyclyl-C(O)O— wherein alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, aryl, heteroaryl, and heterocyclyl are as defined herein.

“Aminosulfonyl” refers to the group —SO₂NR²¹R²², wherein R²¹ and R²² independently are selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, substituted heterocyclic and where R²¹ and R²² are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group and alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic and substituted heterocyclic are as defined herein.

“Sulfonylamino” refers to the group —NR²¹SO₂R²², wherein R²¹ and R²² independently are selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R²¹ and R²² are optionally joined together with the atoms bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

“Aryl” or “Ar” refers to a monovalent aromatic carbocyclic group of from 6 to 18 carbon atoms having a single ring (such as is present in a phenyl group) or a ring system having multiple condensed rings (examples of such aromatic ring systems include naphthyl, anthryl and indanyl) which condensed rings may or may not be aromatic, provided that the point of attachment is through an atom of an aromatic ring. This term includes, by way of example, phenyl and naphthyl. Unless otherwise constrained by the definition for the aryl substituent, such aryl groups can optionally be substituted with from 1 to 5 substituents, or from 1 to 3 substituents, selected from acyloxy, hydroxy, thiol, acyl, alkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, substituted alkyl, substituted alkoxy, substituted alkenyl, substituted alkynyl, substituted cycloalkyl, substituted cycloalkenyl, amino, substituted amino, aminoacyl, acylamino, alkaryl, aryl, aryloxy, azido, carboxyl, carboxylalkyl, cyano, halogen, nitro, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, aminoacyloxy, oxyacylamino, thioalkoxy, substituted thioalkoxy, thioaryloxy, thioheteroaryloxy, —SO-alkyl, —SO-substituted alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-substituted alkyl, —SO₂-aryl, —SO₂-heteroaryl and trihalomethyl.

“Aryloxy” refers to the group —O-aryl, wherein aryl is as defined herein, including, by way of example, phenoxy, naphthoxy, and the like, including optionally substituted aryl groups as also defined herein.

“Amino” refers to the group —NH₂.

The term “substituted amino” refers to the group —NRR where each R is independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, alkenyl, substituted alkenyl, cycloalkenyl, substituted cycloalkenyl, alkynyl, substituted alkynyl, aryl, heteroaryl, and heterocyclyl provided that at least one R is not hydrogen.

The term “azido” refers to the group —N₃.

“Carboxyl,” “carboxy” or “carboxylate” refers to —CO₂H or salts thereof.

“Carboxyl ester” or “carboxy ester” or the terms “carboxyalkyl” or “carboxylalkyl” refers to the groups —C(O)O-alkyl, —C(O)O-substituted alkyl, —C(O)O-alkenyl, —C(O)O-substituted alkenyl, —C(O)O-alkynyl, —C(O)O-substituted alkynyl, —C(O)O-aryl, —C(O)O-substituted aryl, —C(O)O-cycloalkyl, —C(O)O-substituted cycloalkyl, —C(O)O-cycloalkenyl, —C(O)O-substituted cycloalkenyl, —C(O)O-heteroaryl, —C(O)O-substituted heteroaryl, —C(O)O-heterocyclic, and —C(O)O-substituted heterocyclic, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

“(Carboxyl ester)oxy” or “carbonate” refers to the groups —O—C(O)O-alkyl, —O—C(O)O-substituted alkyl, —O—C(O)O-alkenyl, —O—C(O)O-substituted alkenyl, —O—C(O)O-alkynyl, —O—C(O)O-substituted alkynyl, —O—C(O)O-aryl, —O—C(O)O-substituted aryl, —O—C(O)O-cycloalkyl, —O—C(O)O-substituted cycloalkyl, —O—C(O)O-cycloalkenyl, —O—C(O)O-substituted cycloalkenyl, —O—C(O)O-heteroaryl, —O—C(O)O-substituted heteroaryl, —O—C(O)O-heterocyclic, and —O—C(O)O-substituted heterocyclic, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

“Cyano” or “nitrile” refers to the group —CN.

“Cycloalkyl” refers to cyclic alkyl groups of from 3 to 10 carbon atoms having single or multiple cyclic rings including fused, bridged, and spiro ring systems. Examples of suitable cycloalkyl groups include, for instance, adamantyl, cyclopropyl, cyclobutyl, cyclopentyl, cyclooctyl and the like. Such cycloalkyl groups include, by way of example, single ring structures such as cyclopropyl, cyclobutyl, cyclopentyl, cyclooctyl, and the like, or multiple ring structures such as adamantanyl, and the like.

The term “substituted cycloalkyl” refers to cycloalkyl groups having from 1 to 5 substituents, or from 1 to 3 substituents, selected from alkyl, substituted alkyl, alkoxy, substituted alkoxy, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, acyl, acylamino, acyloxy, amino, substituted amino, aminoacyl, aminoacyloxy, oxyaminoacyl, azido, cyano, halogen, hydroxyl, oxo, thioketo, carboxyl, carboxylalkyl, thioaryloxy, thioheteroaryloxy, thioheterocyclooxy, thiol, thioalkoxy, substituted thioalkoxy, aryl, aryloxy, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, hydroxyamino, alkoxyamino, nitro, —SO-alkyl, —SO-substituted alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-substituted alkyl, —SO₂-aryl and —SO₂-heteroaryl.

“Cycloalkenyl” refers to non-aromatic cyclic alkyl groups of from 3 to 10 carbon atoms having single or multiple rings and having at least one double bond and preferably from 1 to 2 double bonds.

The term “substituted cycloalkenyl” refers to cycloalkenyl groups having from 1 to 5 substituents, or from 1 to 3 substituents, selected from alkoxy, substituted alkoxy, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, acyl, acylamino, acyloxy, amino, substituted amino, aminoacyl, aminoacyloxy, oxyaminoacyl, azido, cyano, halogen, hydroxyl, keto, thioketo, carboxyl, carboxylalkyl, thioaryloxy, thioheteroaryloxy, thioheterocyclooxy, thiol, thioalkoxy, substituted thioalkoxy, aryl, aryloxy, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, hydroxyamino, alkoxyamino, nitro, —SO-alkyl, —SO-substituted alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-substituted alkyl, —SO₂-aryl and —SO₂-heteroaryl.

“Cycloalkynyl” refers to non-aromatic cycloalkyl groups of from 5 to 10 carbon atoms having single or multiple rings and having at least one triple bond.

“Cycloalkoxy” refers to —O-cycloalkyl.

“Cycloalkenyloxy” refers to —O-cycloalkenyl.

“Halo” or “halogen” refers to fluoro, chloro, bromo, and iodo.

“Hydroxy” or “hydroxyl” refers to the group —OH.

“Heteroaryl” refers to an aromatic group of from 1 to 15 carbon atoms, such as from 1 to 10 carbon atoms and 1 to 10 heteroatoms selected from the group consisting of oxygen, nitrogen, and sulfur within the ring. Such heteroaryl groups can have a single ring (such as, pyridinyl, imidazolyl or furyl) or multiple condensed rings in a ring system (for example as in groups such as, indolizinyl, quinolinyl, benzofuran, benzimidazolyl or benzothienyl), wherein at least one ring within the ring system is aromatic and at least one ring within the ring system is aromatic, provided that the point of attachment is through an atom of an aromatic ring. In certain embodiments, the nitrogen and/or sulfur ring atom(s) of the heteroaryl group are optionally oxidized to provide for the N-oxide (N→O), sulfinyl, or sulfonyl moieties. The term heteroaryl includes, by way of example, pyridinyl, pyrrolyl, indolyl, thienyl (thiophenyl), imidazolyl, furanyl, oxazolyl, thiazolyl, and the like. Unless otherwise constrained by the definition for the heteroaryl substituent, such heteroaryl groups can be optionally substituted with 1 to 5 substituents, or from 1 to 3 substituents, selected from acyloxy, hydroxy, thiol, acyl, alkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, substituted alkyl, substituted alkoxy, substituted alkenyl, substituted alkynyl, substituted cycloalkyl, substituted cycloalkenyl, amino, substituted amino, aminoacyl, acylamino, alkaryl, aryl, aryloxy, azido, carboxyl, carboxylalkyl, cyano, halogen, nitro, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, aminoacyloxy, oxyacylamino, thioalkoxy, substituted thioalkoxy, thioaryloxy, thioheteroaryloxy, —SO-alkyl, —SO-substituted alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-substituted alkyl, —SO₂-aryl and -SO₂-heteroaryl, and trihalomethyl.

The term “heteroaralkyl” refers to the groups -alkylene-heteroaryl where alkylene and heteroaryl are defined herein. This term includes, by way of example, pyridylmethyl, pyridylethyl, indolylmethyl, and the like.

“Heteroaryloxy” refers to —O-heteroaryl.

“Heterocycle,” “heterocyclic,” “heterocycloalkyl,” and “heterocyclyl” refer to a saturated or unsaturated group having a single ring or multiple condensed rings, including fused bridged and spiro ring systems, and having from 3 to 20 ring atoms, including 1 to 10 hetero atoms. These ring atoms are selected from the group consisting of nitrogen, sulfur, or oxygen, wherein, in fused ring systems, one or more of the rings can be cycloalkyl, aryl, or heteroaryl, provided that the point of attachment is through the non-aromatic ring. In certain embodiments, the nitrogen and/or sulfur atom(s) of the heterocyclic group are optionally oxidized to provide for the N-oxide, —S(O)—, or —SO₂— moieties.

Examples of heterocycles and heteroaryls include, but are not limited to, azetidine, pyrrole, imidazole, pyrazole, pyridine, pyrazine, pyrimidine, pyridazine, indolizine, isoindole, indole, dihydroindole, indazole, purine, quinolizine, isoquinoline, quinoline, phthalazine, naphthylpyridine, quinoxaline, quinazoline, cinnoline, pteridine, carbazole, carboline, phenanthridine, acridine, phenanthroline, isothiazole, phenazine, isoxazole, phenoxazine, phenothiazine, imidazolidine, imidazoline, piperidine, piperazine, indoline, phthalimide, 1,2,3,4-tetrahydroisoquinoline, 4,5,6,7-tetrahydrobenzo[b]thiophene, thiazole, thiazolidine, thiophene, benzo[b]thiophene, morpholinyl, thiomorpholinyl (also referred to as thiamorpholinyl), 1,1-dioxothiomorpholinyl, piperidinyl, pyrrolidine, tetrahydrofuranyl, and the like.

Unless otherwise constrained by the definition for the heterocyclic substituent, such heterocyclic groups can be optionally substituted with 1 to 5, or from 1 to 3 substituents, selected from alkoxy, substituted alkoxy, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, acyl, acylamino, acyloxy, amino, substituted amino, aminoacyl, aminoacyloxy, oxyaminoacyl, azido, cyano, halogen, hydroxyl, oxo, thioketo, carboxyl, carboxylalkyl, thioaryloxy, thioheteroaryloxy, thioheterocyclooxy, thiol, thioalkoxy, substituted thioalkoxy, aryl, aryloxy, heteroaryl, heteroaryloxy, heterocyclyl, heterocyclooxy, hydroxyamino, alkoxyamino, nitro, —SO-alkyl, —SO-substituted alkyl, —SO-aryl, —SO-heteroaryl, —SO₂-alkyl, —SO₂-substituted alkyl, —SO₂-aryl, —SO₂-heteroaryl, and fused heterocycle.

“Heterocyclyloxy” refers to the group —O-heterocyclyl.

The term “heterocyclylthio” refers to the group heterocyclic-S—.

The term “heterocyclene” refers to the diradical group formed from a heterocycle, as defined herein.

The term “hydroxyamino” refers to the group —NHOH.

“Nitro” refers to the group —NO₂.

“Oxo” refers to the atom (═O).

“Sulfonyl” refers to the group SO₂-alkyl, SO₂-substituted alkyl, SO₂-alkenyl, SO₂-substituted alkenyl, SO₂-cycloalkyl, SO₂-substituted cylcoalkyl, SO₂-cycloalkenyl, SO₂-substituted cylcoalkenyl, SO₂-aryl, SO₂-substituted aryl, SO₂-heteroaryl, SO₂-substituted heteroaryl, SO₂-heterocyclic, and SO₂-substituted heterocyclic, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein. Sulfonyl includes, by way of example, methyl-SO₂—, phenyl-SO₂—, and 4-methylphenyl-SO₂—.

“Sulfonyloxy” refers to the group —OSO₂-alkyl, OSO₂-substituted alkyl, OSO₂-alkenyl, OSO₂-substituted alkenyl, OSO₂-cycloalkyl, OSO₂-substituted cylcoalkyl, OSO₂-cycloalkenyl, OSO₂-substituted cylcoalkenyl, OSO₂-aryl, OSO₂-substituted aryl, OSO₂-heteroaryl, OSO₂-substituted heteroaryl, OSO₂-heterocyclic, and OSO₂ substituted heterocyclic, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

The term “aminocarbonyloxy” refers to the group —OC(O)NRR where each R is independently hydrogen, alkyl, substituted alkyl, aryl, heteroaryl, or heterocyclic wherein alkyl, substituted alkyl, aryl, heteroaryl and heterocyclic are as defined herein.

“Thiol” refers to the group —SH.

“Thioxo” or the term “thioketo” refers to the atom (═S).

“Alkylthio” or the term “thioalkoxy” refers to the group —S-alkyl, wherein alkyl is as defined herein. In certain embodiments, sulfur may be oxidized to —S(O)—. The sulfoxide may exist as one or more stereoisomers.

The term “substituted thioalkoxy” refers to the group —S-substituted alkyl.

The term “thioaryloxy” refers to the group aryl-S— wherein the aryl group is as defined herein including optionally substituted aryl groups also defined herein.

The term “thioheteroaryloxy” refers to the group heteroaryl-S— wherein the heteroaryl group is as defined herein including optionally substituted aryl groups as also defined herein.

The term “thioheterocyclooxy” refers to the group heterocyclyl-S— wherein the heterocyclyl group is as defined herein including optionally substituted heterocyclyl groups as also defined herein.

In addition to the disclosure herein, the term “substituted,” when used to modify a specified group or radical, can also mean that one or more hydrogen atoms of the specified group or radical are each, independently of one another, replaced with the same or different substituent groups as defined below.

In addition to the groups disclosed with respect to the individual terms herein, substituent groups for substituting for one or more hydrogens (any two hydrogens on a single carbon can be replaced with ═O, ═NR⁷⁰, ═N—OR⁷⁰, ═N₂ or ═S) on saturated carbon atoms in the specified group or radical are, unless otherwise specified, —R⁶⁰, halo, ═O, —OR⁷⁰, —SR⁷⁰, —NR⁸⁰R⁸⁰, trihalomethyl, —CN, —OCN, —SCN, —NO, —NO₂, ═N₂, —N₃, —SO₂R⁷⁰, —SO₂O⁻M⁺, —SO₂OR⁷⁰, —OSO₂R⁷⁰, —OSO₂O⁻M⁺, —OSO₂OR⁷⁰, —P(O)(O⁻)₂(M⁺)₂, —P(O)(OR⁷⁰)O⁻M⁺, —P(O)(OR⁷⁰)₂, —C(O)R⁷⁰, —C(S)R⁷⁰, —C(NR⁷⁰)R⁷⁰, —C(O)O⁻M⁺, —C(O)OR⁷⁰, —C(S)OR⁷⁰, —C(O)NR⁸⁰R⁸⁰, —C(NR⁷⁰)NR⁸⁰R⁸⁰, —OC(O)R⁷⁰, —OC(S)R⁷⁰, —OC(O)O⁻M⁺, —OC(O)OR⁷⁰, —OC(S)OR⁷⁰, —NR⁷⁰C(O)R⁷⁰, —NR⁷⁰C(S)R⁷⁰, —NR⁷⁰CO₂ ⁻M⁺, —NR⁷⁰CO₂R⁷⁰, —NR⁷⁰C(S)OR⁷⁰, —NR⁷⁰C(O)NR⁸⁰R⁸⁰, —NR⁷⁰C(NR⁷⁰)R⁷⁰ and —NR⁷⁰C(NR⁷⁰)NR⁸⁰R⁸⁰, where R⁶⁰ is selected from the group consisting of optionally substituted alkyl, cycloalkyl, heteroalkyl, heterocycloalkylalkyl, cycloalkylalkyl, aryl, arylalkyl, heteroaryl and heteroarylalkyl, each R⁷⁰ is independently hydrogen or R⁶⁰; each R⁸⁰ is independently R⁷⁰ or alternatively, two R^(80′)s, taken together with the nitrogen atom to which they are bonded, form a 5-, 6- or 7-membered heterocycloalkyl which may optionally include from 1 to 4 of the same or different additional heteroatoms selected from the group consisting of O, N and S, of which N may have —H or C₁-C₃ alkyl substitution; and each M⁺ is a counter ion with a net single positive charge. Each M⁺ may independently be, for example, an alkali ion, such as K⁺, Na⁺, Li⁺; an ammonium ion, such as ⁺N(R⁶⁰)₄; or an alkaline earth ion, such as [Ca²⁺]_(0.5), [Mg²⁺]_(0.5), or [Ba²⁺]_(0.5) (“subscript 0.5 means that one of the counter ions for such divalent alkali earth ions can be an ionized form of a compound of the invention and the other a typical counter ion such as chloride, or two ionized compounds disclosed herein can serve as counter ions for such divalent alkali earth ions, or a doubly ionized compound of the invention can serve as the counter ion for such divalent alkali earth ions). As specific examples, —NR⁸⁰R⁸⁰ is meant to include —NH₂, —NH-alkyl, N-pyrrolidinyl, N-piperazinyl, 4N-methyl-piperazin-1-yl and N-morpholinyl.

In addition to the disclosure herein, substituent groups for hydrogens on unsaturated carbon atoms in “substituted” alkene, alkyne, aryl and heteroaryl groups are, unless otherwise specified, —R⁶⁰, halo, —O⁻M⁺, —OR⁷⁰, —SR⁷⁰, —NR⁸⁰R⁸⁰, trihalomethyl, —CF₃, —CN, —OCN, —SCN, —NO, —NO₂, —N₃, —SO₂R⁷⁰, —SO₃ ⁻M⁺, —SO₃R⁷⁰, —OSO₂R⁷⁰, —OSO₃ ⁻M⁺, —OSO₃R⁷⁰, —PO₃ ⁻²(M⁺)₂, —P(O)(OR⁷⁰)O⁻M⁺, —P(O)(OR⁷⁰)₂, —C(O)R⁷⁰, —C(S)R⁷⁰, —C(NR⁷⁰)R⁷⁰, —CO₂ ⁻M⁺, —CO₂R⁷⁰, —C(S)OR⁷⁰, —C(O)NR⁸⁰R⁸⁰, —C(NR⁷⁰)NR⁸⁰R⁸⁰, —OC(O)R⁷⁰, —OC(S)R⁷⁰, —OCO₂ ⁻M⁺, —OCO₂R⁷⁰, —OC(S)OR⁷⁰, —NR⁷⁰C(O)R⁷⁰, —NR⁷⁰C(S)R⁷⁰, —NR⁷⁰CO₂ ⁻M⁺, —NR⁷⁰CO₂R⁷⁰, —NR⁷⁰C(S)OR⁷⁰, —NR⁷⁰C(O)NR⁸⁰R⁸⁰, —NR⁷⁰C(NR⁷⁰)R⁷⁰ and —NR⁷⁰C(NR⁷⁰)NR⁸⁰R⁸⁰, where R⁶⁰, R⁷⁰, R⁸⁰ and M⁺ are as previously defined, provided that in case of substituted alkene or alkyne, the substituents are not —O⁻M⁺, —OR⁷⁰, —SR⁷⁰, or —S⁻M⁺.

In addition to the groups disclosed with respect to the individual terms herein, substituent groups for hydrogens on nitrogen atoms in “substituted” heteroalkyl and cycloheteroalkyl groups are, unless otherwise specified, —R⁶⁰, —O⁻M⁺, —OR⁷⁰, —SR⁷⁰, —S⁻M⁺, —NR⁸⁰R⁸⁰, trihalomethyl, —CF₃, —CN, —NO, —NO₂, —S(O)₂R⁷⁰, —S(O)₂O⁻M⁺, —S(O)₂OR⁷⁰, —OS(O)₂R⁷⁰, —OS(O)₂O⁻M⁺, —OS(O)₂OR⁷⁰, —P(O)(O⁻)₂(M⁺)₂, —P(O)(OR⁷⁰)O⁻M⁺, —P(O)(OR⁷⁰)(OR⁷⁰), —C(O)R⁷⁰, —C(S)R⁷⁰, —C(NR⁷⁰)R⁷⁰, —C(O)OR⁷⁰, —C(S)OR⁷⁰, —C(O)NR⁸⁰ R⁸⁰, —C(NR⁷⁰)NR⁸⁰ R⁸⁰, —OC(O)R⁷⁰, —OC(S)R⁷⁰, —OC(O)OR⁷⁰, —OC(S)OR⁷⁰, —NR⁷⁰C(O)R⁷⁰, —NR⁷⁰C(S)R⁷⁰, —NR⁷⁰C(O)OR⁷⁰, —NR⁷⁰C(S)OR⁷⁰, —NR⁷⁰C(O)NR⁸⁰R⁸⁰, —NR⁷⁰C(NR⁷⁰)R⁷⁰ and —NR⁷⁰C(NR⁷⁰)NR⁸⁰R⁸⁰, where R⁶⁰, R⁷⁰, R⁸⁰ and M⁺ are as previously defined.

In addition to the disclosure herein, in a certain embodiment, a group that is substituted has 1, 2, 3, or 4 substituents, 1, 2, or 3 substituents, 1 or 2 substituents, or 1 substituent.

It is understood that in all substituted groups defined above, polymers arrived at by defining substituents with further substituents to themselves (e.g., substituted aryl having a substituted aryl group as a substituent which is itself substituted with a substituted aryl group, which is further substituted by a substituted aryl group, etc.) are not intended for inclusion herein. In such cases, the maximum number of such substitutions is three. For example, serial substitutions of substituted aryl groups specifically contemplated herein are limited to substituted aryl-(substituted aryl)-substituted aryl.

Unless indicated otherwise, the nomenclature of substituents that are not explicitly defined herein are arrived at by naming the terminal portion of the functionality followed by the adjacent functionality toward the point of attachment. For example, the substituent “arylalkyloxycarbonyl” refers to the group (aryl)-(alkyl)-O—C(O)—.

As to any of the groups disclosed herein which contain one or more substituents, it is understood, of course, that such groups do not contain any substitution or substitution patterns which are sterically impractical and/or synthetically non-feasible. In addition, the subject compounds include all stereochemical isomers arising from the substitution of these compounds.

The term “pharmaceutically acceptable salt” means a salt which is acceptable for administration to a patient, such as a mammal (salts with counterions having acceptable mammalian safety for a given dosage regime). Such salts can be derived from pharmaceutically acceptable inorganic or organic bases and from pharmaceutically acceptable inorganic or organic acids. “Pharmaceutically acceptable salt” refers to pharmaceutically acceptable salts of a compound, which salts are derived from a variety of organic and inorganic counter ions well known in the art and include, by way of example only, sodium, potassium, calcium, magnesium, ammonium, tetraalkylammonium, and the like; and when the molecule contains a basic functionality, salts of organic or inorganic acids, such as hydrochloride, hydrobromide, formate, tartrate, besylate, mesylate, acetate, maleate, oxalate, and the like.

The term “salt thereof” means a compound formed when a proton of an acid is replaced by a cation, such as a metal cation or an organic cation and the like. Where applicable, the salt is a pharmaceutically acceptable salt, although this is not required for salts of intermediate compounds that are not intended for administration to a patient. By way of example, salts of the present compounds include those wherein the compound is protonated by an inorganic or organic acid to form a cation, with the conjugate base of the inorganic or organic acid as the anionic component of the salt.

“Solvate” refers to a complex formed by combination of solvent molecules with molecules or ions of the solute. The solvent can be an organic compound, an inorganic compound, or a mixture of both. Some examples of solvents include, but are not limited to, methanol, N,N-dimethylformamide, tetrahydrofuran, dimethylsulfoxide, and water. When the solvent is water, the solvate formed is a hydrate.

“Stereoisomer” and “stereoisomers” refer to compounds that have same atomic connectivity but different atomic arrangement in space. Stereoisomers include cis-trans isomers, E and Z isomers, enantiomers, and diastereomers.

“Tautomer” refers to alternate forms of a molecule that differ only in electronic bonding of atoms and/or in the position of a proton, such as enol-keto and imine-enamine tautomers, or the tautomeric forms of heteroaryl groups containing a —N═C(H)—NH— ring atom arrangement, such as pyrazoles, imidazoles, benzimidazoles, triazoles, and tetrazoles. A person of ordinary skill in the art would recognize that other tautomeric ring atom arrangements are possible.

It will be appreciated that the term “or a salt or solvate or stereoisomer thereof” is intended to include all permutations of salts, solvates and stereoisomers, such as a solvate of a pharmaceutically acceptable salt of a stereoisomer of subject compound.

“Pharmaceutically effective amount” and “therapeutically effective amount” refer to an amount of a compound sufficient to treat a specified disorder or disease or one or more of its symptoms and/or to prevent the occurrence of the disease or disorder. In reference to tumorigenic proliferative disorders, a pharmaceutically or therapeutically effective amount comprises an amount sufficient to, among other things, cause the tumor to shrink or decrease the growth rate of the tumor.

“Patient” or “subject” refers to human and non-human subjects, especially mammalian subjects.

The term “treating” or “treatment” as used herein means the treating or treatment of a disease or medical condition in a patient, such as a mammal (particularly a human) that includes: (a) preventing the disease or medical condition from occurring, such as, prophylactic treatment of a subject; (b) ameliorating the disease or medical condition, such as, eliminating or causing regression of the disease or medical condition in a patient; (c) suppressing the disease or medical condition, for example by, slowing or arresting the development of the disease or medical condition in a patient; or (d) alleviating a symptom of the disease or medical condition in a patient.

The terms “polypeptide,” “peptide,” and “protein” are used interchangeably herein to refer to a polymeric form of amino acids of any length. Unless specifically indicated otherwise, “polypeptide,” “peptide,” and “protein” can include genetically coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified peptide backbones. The term includes fusion proteins, including, but not limited to, fusion proteins with a heterologous amino acid sequence, fusions with heterologous and homologous leader sequences, proteins which contain at least one N-terminal methionine residue (e.g., to facilitate production in a recombinant bacterial host cell); immunologically tagged proteins; and the like.

“Native amino acid sequence” or “parent amino acid sequence” are used interchangeably herein to refer to the amino acid sequence of a polypeptide prior to modification to include a modified amino acid residue.

The terms “amino acid analog,” “unnatural amino acid,” and the like may be used interchangeably, and include amino acid-like compounds that are similar in structure and/or overall shape to one or more amino acids commonly found in naturally occurring proteins (e.g., Ala or A, Cys or C, Asp or D, Glu or E, Phe or F, Gly or G, His or H, Ile or I, Lys or K, Leu or L, Met or M, Asn or N, Pro or P, Gln or Q, Arg or R, Ser or S, Thr or T, Val or V, Trp or W, Tyr or Y). Amino acid analogs also include natural amino acids with modified side chains or backbones. Amino acid analogs also include amino acid analogs with the same stereochemistry as in the naturally occurring D-form, as well as the L-form of amino acid analogs. In some instances, the amino acid analogs share backbone structures, and/or the side chain structures of one or more natural amino acids, with difference(s) being one or more modified groups in the molecule. Such modification may include, but is not limited to, substitution of an atom (such as N) for a related atom (such as S), addition of a group (such as methyl, or hydroxyl, etc.) or an atom (such as Cl or Br, etc.), deletion of a group, substitution of a covalent bond (single bond for double bond, etc.), or combinations thereof. For example, amino acid analogs may include α-hydroxy acids, and α-amino acids, and the like. Other amino acid analogs may include 2-formylglycine (FGly).

The term “carbohydrate” and the like may be used to refer to monomers units and/or polymers of monosaccharides, disaccharides, oligosaccharides, and polysaccharides. The term sugar may be used to refer to the smaller carbohydrates, such as monosaccharides, disaccharides. The term “carbohydrate derivative” includes compounds where one or more functional groups of a carbohydrate of interest are substituted (replaced by any convenient substituent), modified (converted to another group using any convenient chemistry) or absent (e.g., eliminated or replaced by H). A variety of carbohydrates and carbohydrate derivatives are available and may be adapted for use in the subject compounds and conjugates.

The term “antibody” is used in the broadest sense and includes monoclonal antibodies (including full length monoclonal antibodies), polyclonal antibodies, and multispecific antibodies (e.g., bispecific antibodies), humanized antibodies, single-chain antibodies, chimeric antibodies, antibody fragments (e.g., Fab fragments), and the like. An antibody is capable of binding a target antigen. (Janeway, C., Travers, P., Walport, M., Shlomchik (2001) Immuno Biology, 5th Ed., Garland Publishing, New York). A target antigen can have one or more binding sites, also called epitopes, recognized by complementarity determining regions (CDRs) formed by one or more variable regions of an antibody.

The term “natural antibody” refers to an antibody in which the heavy and light chains of the antibody have been made and paired by the immune system of a multi-cellular organism. Spleen, lymph nodes, bone marrow and serum are examples of tissues that produce natural antibodies. For example, the antibodies produced by the antibody producing cells isolated from a first animal immunized with an antigen are natural antibodies.

The term “humanized antibody” or “humanized immunoglobulin” refers to a non-human (e.g., mouse or rabbit) antibody containing one or more amino acids (in a framework region, a constant region or a CDR, for example) that have been substituted with a correspondingly positioned amino acid from a human antibody. In general, humanized antibodies produce a reduced immune response in a human host, as compared to a non-humanized version of the same antibody. Antibodies can be humanized using a variety of techniques known in the art including, for example, CDR-grafting (EP 239,400; PCT publication WO 91/09967; U.S. Pat. Nos. 5,225,539; 5,530,101; and 5,585,089), veneering or resurfacing (EP 592,106; EP 519,596; Padlan, Molecular Immunology 28(4/5):489-498 (1991); Studnicka et al., Protein Engineering 7(6):805-814 (1994); Roguska. et al., PNAS 91:969-973 (1994)), and chain shuffling (U.S. Pat. No. 5,565,332). In certain embodiments, framework substitutions are identified by modeling of the interactions of the CDR and framework residues to identify framework residues important for antigen binding and sequence comparison to identify unusual framework residues at particular positions (see, e.g., U.S. Pat. No. 5,585,089; Riechmann et al., Nature 332:323 (1988)). Additional methods for humanizing antibodies contemplated for use in the present invention are described in U.S. Pat. Nos. 5,750,078; 5,502,167; 5,705,154; 5,770,403; 5,698,417; 5,693,493; 5,558,864; 4,935,496; and 4,816,567, and PCT publications WO 98/45331 and WO 98/45332. In particular embodiments, a subject rabbit antibody may be humanized according to the methods set forth in US20040086979 and US20050033031. Accordingly, the antibodies described above may be humanized using methods that are well known in the art.

The term “chimeric antibodies” refer to antibodies whose light and heavy chain genes have been constructed, typically by genetic engineering, from antibody variable and constant region genes belonging to different species. For example, the variable segments of the genes from a mouse monoclonal antibody may be joined to human constant segments, such as gamma 1 and gamma 3. An example of a therapeutic chimeric antibody is a hybrid protein composed of the variable or antigen-binding domain from a mouse antibody and the constant or effector domain from a human antibody, although domains from other mammalian species may be used.

By “genetically-encodable” as used in reference to an amino acid sequence of polypeptide, peptide or protein means that the amino acid sequence is composed of amino acid residues that are capable of production by transcription and translation of a nucleic acid encoding the amino acid sequence, where transcription and/or translation may occur in a cell or in a cell-free in vitro transcription/translation system.

The term “control sequences” refers to DNA sequences that facilitate expression of an operably linked coding sequence in a particular expression system, e.g. mammalian cell, bacterial cell, cell-free synthesis, etc. The control sequences that are suitable for prokaryote systems, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cell systems may utilize promoters, polyadenylation signals, and enhancers.

A nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate the initiation of translation. Generally, “operably linked” means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and in reading frame. Linking is accomplished by ligation or through amplification reactions. Synthetic oligonucleotide adaptors or linkers may be used for linking sequences in accordance with conventional practice.

The term “expression cassette” as used herein refers to a segment of nucleic acid, usually DNA, that can be inserted into a nucleic acid (e.g., by use of restriction sites compatible with ligation into a construct of interest or by homologous recombination into a construct of interest or into a host cell genome). In general, the nucleic acid segment comprises a polynucleotide that encodes a polypeptide of interest, and the cassette and restriction sites are designed to facilitate insertion of the cassette in the proper reading frame for transcription and translation. Expression cassettes can also comprise elements that facilitate expression of a polynucleotide encoding a polypeptide of interest in a host cell. These elements may include, but are not limited to: a promoter, a minimal promoter, an enhancer, a response element, a terminator sequence, a polyadenylation sequence, and the like.

As used herein the term “isolated” is meant to describe a compound of interest that is in an environment different from that in which the compound naturally occurs. “Isolated” is meant to include compounds that are within samples that are substantially enriched for the compound of interest and/or in which the compound of interest is partially or substantially purified.

As used herein, the term “substantially purified” refers to a compound that is removed from its natural environment and is at least 60% free, at least 75% free, at least 80% free, at least 85% free, at least 90% free, at least 95% free, at least 98% free, or more than 98% free, from other components with which it is naturally associated.

The term “physiological conditions” is meant to encompass those conditions compatible with living cells, e.g., predominantly aqueous conditions of a temperature, pH, salinity, etc. that are compatible with living cells.

By “reactive partner” is meant a molecule or molecular moiety that specifically reacts with another reactive partner to produce a reaction product. Exemplary reactive partners include a cysteine or serine of a sulfatase motif and Formylglycine Generating Enzyme (FGE), which react to form a reaction product of a converted aldehyde tag containing a formylglycine (fGly) in lieu of cysteine or serine in the motif. Other exemplary reactive partners include an aldehyde of an fGly residue of a converted aldehyde tag (e.g., a reactive aldehyde group) and an “aldehyde-reactive reactive partner”, which comprises an aldehyde-reactive group and a moiety of interest, and which reacts to form a reaction product of a modified aldehyde tagged polypeptide having the moiety of interest conjugated to the modified polypeptide through a modified fGly residue.

“N-terminus” refers to the terminal amino acid residue of a polypeptide having a free amine group, which amine group in non-N-terminus amino acid residues normally forms part of the covalent backbone of the polypeptide.

“C-terminus” refers to the terminal amino acid residue of a polypeptide having a free carboxyl group, which carboxyl group in non-C-terminus amino acid residues normally forms part of the covalent backbone of the polypeptide.

By “internal site” as used in referenced to a polypeptide or an amino acid sequence of a polypeptide means a region of the polypeptide that is not at the N-terminus or at the C-terminus.

Before the present invention is further described, it is to be understood that this invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.

Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range, is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges, and are also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.

It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination. All combinations of the embodiments pertaining to the invention are specifically embraced by the present invention and are disclosed herein just as if each and every combination was individually and explicitly disclosed, to the extent that such combinations embrace subject matter that are, for example, compounds that are stable compounds (i.e., compounds that can be made, isolated, characterized, and tested for biological activity). In addition, all sub-combinations of the various embodiments and elements thereof (e.g., elements of the chemical groups listed in the embodiments describing such variables) are also specifically embraced by the present invention and are disclosed herein just as if each and every such sub-combination was individually and explicitly disclosed herein.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present invention, the preferred methods and materials are now described. All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited.

It must be noted that as used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. It is further noted that the claims may be drafted to exclude any optional element. As such, this statement is intended to serve as antecedent basis for use of such exclusive terminology as “solely,” “only” and the like in connection with the recitation of claim elements, or use of a “negative” limitation.

It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination.

The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided may be different from the actual publication dates which may need to be independently confirmed.

DETAILED DESCRIPTION

The present disclosure provides conjugates (e.g., polypeptide conjugates), hydrazinyl-substituted heteroaryl compounds for producing the conjugates and methods of making and using the same. Embodiments of each are described in more detail in the sections below.

Conjugates

The present disclosure provides conjugates. By “conjugate” is meant a first moiety that is stably associated with a second moiety. By “stably associated” is meant that a moiety is bound to another moiety or structure under standard conditions. In certain embodiments, the two moieties are bound to each other through one or more covalent bonds.

In certain embodiments, the conjugate is a polypeptide conjugate, which includes a polypeptide conjugated to a second moiety. As described in more detail below, the moiety conjugated to the polypeptide can be any of a variety of moieties such as, but not limited to, a detectable label, a drug, a water-soluble polymer, or a moiety for immobilization of the polypeptide to a membrane or a surface. The moiety of interest can be conjugated to the polypeptide at any desired site of the polypeptide. Thus, the present disclosure provides, for example, a modified polypeptide having a moiety conjugated at a site at or near the C-terminus of the polypeptide. Other examples include a modified polypeptide having a moiety conjugated at a position at or near the N-terminus of the polypeptide. Examples also include a modified polypeptide having a moiety conjugated at a position between the C-terminus and the N-terminus of the polypeptide (e.g., at an internal site of the polypeptide). Combinations of the above are also possible where the modified polypeptide is conjugated to two or more moieties.

Embodiments of the present disclosure include conjugates where a polypeptide is conjugated to one or more moieties, such as 2 moieties, 3 moieties, 4 moieties, 5 moieties, 6 moieties, 7 moieties, 8 moieties, 9 moieties, or 10 or more moieties. The moieties may be conjugated to the polypeptide at one or more sites in the polypeptide. For example, one or more moieties may be conjugated to a single amino acid residue of the polypeptide. In some cases, one moiety is conjugated to an amino acid residue of the polypeptide. In other embodiments, two moieties may be conjugated to the same amino acid residue of the polypeptide. In other embodiments, a first moiety is conjugated to a first amino acid residue of the polypeptide and a second moiety is conjugated to a second amino acid residue of the polypeptide. Combinations of the above are also possible, for example where a polypeptide is conjugated to a first moiety at a first amino acid residue and conjugated to two other moieties at a second amino acid residue. Other combinations are also possible, such as, but not limited to, a polypeptide conjugated to first and second moieties at a first amino acid residue and conjugated to third and fourth moieties at a second amino acid residue, etc.

The one or more amino acid residues that are conjugated to the one or more moieties may be naturally occurring amino acids, unnatural amino acids, or combinations thereof. For instance, the conjugate may include a moiety conjugated to a naturally occurring amino acid residue of the polypeptide. In other instances, the conjugate may include a moiety conjugated to an unnatural amino acid residue of the polypeptide. One or more moieties may be conjugated to the polypeptide at a single natural or unnatural amino acid residue as described above. One or more natural or unnatural amino acid residues in the polypeptide may be conjugated to the moiety or moieties as described herein. For example, two (or more) amino acid residues (e.g., natural or unnatural amino acid residues) in the polypeptide may each be conjugated to one or two moieties, such that multiple sites in the polypeptide are modified.

As described above, one or more moieties may be conjugated to the polypeptide to form a moiety-polypeptide conjugate. As such, in a mixture of conjugates (e.g., a reaction mixture from a conjugation reaction) the ratio of the moiety to polypeptide in the conjugate can range from 0.1 to 100, such as 0.1 to 75, or 0.1 to 50, or 0.1 to 25, or 0.1 to 10, or 0.5 to 10, or 0.5 to 9, or 0.5 to 8, or 0.5 to 7, or 0.5 to 6, or 0.5 to 5, or 0.5 to 4, or 0.5 to 3, or 0.5 to 2. For instance, the ratio of the moiety to polypeptide in the conjugate can range from 1 to 2. In some cases, the ratio of the moiety to polypeptide in the conjugate is 1 or 1.5. In some cases, the ratio of the moiety to polypeptide in the conjugate is measured as an average ratio from a mixture of conjugates. The term “average” as used herein refers to the arithmetic mean. In certain embodiments, the moiety conjugated to the polypeptide may be a chemical entity, such as, but not limited to, a drug or a detectable label. For example, the moiety conjugated to the polypeptide may be a drug. In addition, in some cases, the polypeptide may be an antibody. Thus, the conjugate can be an antibody-drug conjugate. In some embodiments, in a mixture of conjugates (e.g., a reaction mixture from a conjugation reaction) the ratio of the drug to antibody in the conjugate (Drug-to-Antibody Ratio; DAR) can range from 0.1 to 100, such as 0.1 to 75, or 0.1 to 50, or 0.1 to 25, or 0.1 to 10, or 0.5 to 10, or 0.5 to 9, or 0.5 to 8, or 0.5 to 7, or 0.5 to 6, or 0.5 to 5, or 0.5 to 4, or 0.5 to 3, or 0.5 to 2. For instance, the DAR can range from 1 to 2. In some cases, the DAR is 1 or 1.5. In some cases, the DAR is measured as an average ratio from a mixture of conjugates.

Although described herein in terms of a polypeptide conjugated to one or more moieties (e.g., a chemical entity, a polypeptide, etc.), embodiments of the present disclosure also include conjugates where a moiety (e.g., a chemical entity, such as a drug or a detectable label) is conjugated to one or more other moieties (e.g., a chemical entity, a polypeptide, etc.). For example, a drug may be conjugated to one or more other moieties (e.g., a chemical entity, a polypeptide, etc.), or in other embodiments, a detectable label may be conjugated to one or more other moieties (e.g., a chemical entity, a polypeptide, etc.). Thus, for instance, embodiments of the present disclosure include, but are not limited to, the following: a conjugate of a polypeptide and a drug; a conjugate of a polypeptide and a detectable label; a conjugate of two or more polypeptides; a conjugate of two or more drugs; a conjugate of two of more detectable labels; a conjugate of a drug and a detectable label; a conjugate of a polypeptide, a drug and a detectable label; a conjugate of a polypeptide and two or more drugs; a conjugate of a polypeptide and two or more detectable labels; a conjugate of a drug and two or more polypeptides; a conjugate of a detectable label and two or more polypeptides; and the like.

In certain embodiments, the polypeptide and the moiety of interest are conjugated through a coupling moiety. For example, the polypeptide and the moiety of interest may each be bound (e.g., covalently bonded) to the coupling moiety, thus indirectly binding the polypeptide and the moiety of interest together through the coupling moiety. In some cases, the coupling moiety includes a hydrazinyl-substituted heteroaryl compound or a derivative of a hydrazinyl-substituted heteroaryl compound. In some instances, the heteroaryl group of the hydrazinyl-substituted heteroaryl compound is a 5-membered heteroaryl ring, where one or more atoms in the ring is other than carbon (such as N, O or S). For instance, one atom in the ring may be N, O or S. In other instances, two atoms in the ring are N, or one atom is N and another atom is O, or one atom is N and another atom is S, or one atom is O and another atom is S. Examples of heteroaryl groups that can be included in the hydrazinyl-substituted heteroaryl coupling moiety include, but are not limited to, imidazolyl, pyrrolyl, furanyl, thienyl (thiophenyl), and the like. For example, in certain embodiments the heteroaryl group of the hydrazinyl-substituted heteroaryl coupling moiety is imidazolyl. In other embodiments, the the heteroaryl group of the hydrazinyl-substituted heteroaryl coupling moiety is pyrrolyl. In other embodiments, the the heteroaryl group of the hydrazinyl-substituted heteroaryl coupling moiety is furanyl.

A general scheme for coupling a moiety of interest to a polypeptide through a hydrazinyl-substituted heteroaryl coupling moiety is shown in general reaction scheme A below.

In general reaction scheme A, R may be the moiety of interest conjugated to the polypeptide. As described herein, the moiety of interest can be any of a variety of moieties such as, but not limited to, a chemical entity, such as a detectable label, a drug, a water-soluble polymer, or a moiety for immobilization of the polypeptide to a membrane or a surface of a substrate. R′ and R″ may each independently be any desired substituent, such as, but not limited to, hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. Z′ may be selected from CR²⁰, C, N, O and S. Z″ may be C or N. In some embodiments, one or both of Z′ and Z″ is not a carbon. R²⁰ may be any desired substituent, such as hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, or substituted heterocyclyl.

Other coupling schemes involving a hydrazinyl-substituted heteroaryl coupling moiety are also possible. For example, another general scheme for coupling a moiety of interest to a polypeptide through a hydrazinyl-substituted heteroaryl coupling moiety is shown in the general reaction scheme B below.

In general reaction scheme B, R may be the moiety of interest conjugated to the polypeptide. As described herein, the moiety can be any of a variety of moieties such as, but not limited to, a chemical entity, such as a detectable label, a drug, a water-soluble polymer, or a moiety for immobilization of the polypeptide to a membrane or a surface of a substrate. R′ and R″ may each independently be any desired substituent, such as, but not limited to, hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. Z′ may be selected from CR²⁰, C, N, O and S. Z″ may be C or N. In some embodiments, one or both of Z′ and Z″ is not a carbon. R²⁰ may be any desired substituent, such as hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, or substituted heterocyclyl.

Examples of coupling moieties are shown in the conjugates and compounds described in more detail below.

For example, aspects of the present disclosure include a conjugate (e.g., a polypeptide conjugate) that includes at least one modified amino acid residue of formula (I):

wherein:

R¹ and R² are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

one of R³ and R⁴ is a polypeptide and the other is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl;

Z¹ is selected from CR⁷, N, O and S;

Z² is C or N;

R⁷ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

L is a linker; and

W is a drug or a detectable label.

In certain embodiments, R¹ and R² are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments, R¹ is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R¹ is hydrogen. In certain embodiments, R¹ is alkyl or substituted alkyl. For instance, R¹ may be alkyl, such as C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R¹ is alkenyl or substituted alkenyl. In certain embodiments, R¹ is alkynyl or substituted alkynyl. In certain embodiments, R¹ is aryl or substituted aryl. In certain embodiments, R¹ is heteroaryl or substituted heteroaryl. In certain embodiments, R¹ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R¹ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R² is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R² is hydrogen. In certain embodiments, R² is alkyl or substituted alkyl. For instance, R² may be alkyl, such as C₁₋₁₀ alkyl, or C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R² is alkenyl or substituted alkenyl. In certain embodiments, R² is alkynyl or substituted alkynyl. In certain embodiments, R² is aryl or substituted aryl. In certain embodiments, R² is heteroaryl or substituted heteroaryl. In certain embodiments, R² is cycloalkyl or substituted cycloalkyl. In certain embodiments, R² is heterocyclyl or substituted heterocyclyl.

In certain embodiments, one of R³ and R⁴ is a polypeptide and the other is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments, R³ is the polypeptide. In these instances, R⁴ may be selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. For example, R⁴ may be hydrogen. In other embodiments, R⁴ is alkyl or substituted alkyl. For instance, R⁴ may be alkyl, such as C₁₋₁₀ alkyl, or C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R⁴ is alkenyl or substituted alkenyl. In certain embodiments, R⁴ is alkynyl or substituted alkynyl. In certain embodiments, R⁴ is aryl or substituted aryl. In certain embodiments, R⁴ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁴ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁴ is heterocyclyl or substituted heterocyclyl.

In other embodiments, R⁴ is the polypeptide. In these instances, R³ may be selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. For example, R³ may be hydrogen. In other embodiments, R³ is alkyl or substituted alkyl. For instance, R³ may be alkyl, such as C₁₋₁₀ alkyl, or C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R³ is alkenyl or substituted alkenyl. In certain embodiments, R³ is alkynyl or substituted alkynyl. In certain embodiments, R³ is aryl or substituted aryl. In certain embodiments, R³ is heteroaryl or substituted heteroaryl. In certain embodiments, R³ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R³ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl.

In certain embodiments, R⁵ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R⁵ is hydrogen. In certain embodiments, R⁵ is alkyl or substituted alkyl. In certain embodiments, R⁵ is alkenyl or substituted alkenyl. In certain embodiments, R⁵ is alkynyl or substituted alkynyl. In certain embodiments, R⁵ is alkoxy or substituted alkoxy. In certain embodiments, R⁵ is amino or substituted amino. In certain embodiments, R⁵ is carboxyl or carboxyl ester. In certain embodiments, R⁵ is acyl or acyloxy. In certain embodiments, R⁵ is acyl amino or amino acyl. In certain embodiments, R⁵ is alkylamide or substituted alkylamide. In certain embodiments, R⁵ is sulfonyl. In certain embodiments, R⁵ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R⁵ is aryl or substituted aryl. In certain embodiments, R⁵ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁵ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁵ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R⁵ is alkyl or substituted alkyl. For example, R⁵ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In some cases, R⁵ is methyl, ethyl, n-propyl, iso-propyl, n-butyl, sec-butyl, isobutyl, t-butyl, or the like. In certain cases, R⁵ is methyl.

In certain embodiments, R⁶ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R⁶ is hydrogen. In certain embodiments, R⁶ is alkyl or substituted alkyl. In certain embodiments, R⁶ is alkenyl or substituted alkenyl. In certain embodiments, R⁶ is alkynyl or substituted alkynyl. In certain embodiments, R⁶ is alkoxy or substituted alkoxy. In certain embodiments, R⁶ is amino or substituted amino. In certain embodiments, R⁶ is carboxyl or carboxyl ester. In certain embodiments, R⁶ is acyl or acyloxy. In certain embodiments, R⁶ is acyl amino or amino acyl. In certain embodiments, R⁶ is alkylamide or substituted alkylamide. In certain embodiments, R⁶ is sulfonyl. In certain embodiments, R⁶ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R⁶ is aryl or substituted aryl. In certain embodiments, R⁶ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁶ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁶ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R⁶ is alkyl or substituted alkyl. For example, R⁶ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In some cases, R⁶ is methyl, ethyl, n-propyl, iso-propyl, n-butyl, sec-butyl, isobutyl, t-butyl, or the like. In certain cases, R⁶ is methyl.

In certain embodiments, R⁵ and R⁶ are each independently selected from alkyl and substituted alkyl. For example, R⁵ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl), and R⁶ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁-C₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In some cases, R⁵ and R⁶ are each independently selected from methyl, ethyl, n-propyl, iso-propyl, n-butyl, sec-butyl, isobutyl, t-butyl, or the like. In certain cases, R⁵ and R⁶ are each methyl.

In certain embodiments, R⁵ and R⁶ are optionally cyclically linked to form a 5 or 6-membered heterocyclyl. In some instances, R⁵ and R⁶ (together with the atoms to which they are attached) may be cyclically linked to form a 5-membered heterocyclyl. In some instances, R⁵ and R⁶ (together with the atoms to which they are attached) may be cyclically linked to form a 6-membered heterocyclyl.

In certain embodiments, Z¹ is selected from CR⁷, N, O and S. In certain embodiments, Z¹ is CR⁷. In certain embodiments, Z¹ is N. In certain embodiments, Z¹ is O. In certain embodiments, Z¹ is S.

In certain embodiments, Z² is C or N. In certain embodiments, Z² is C. In certain embodiments, Z² is N.

Various combinations of Z¹ and Z² are possible. For example, in certain embodiments, Z¹ and Z² are each N. In other instances, Z¹ is CR⁷ and Z² is N. In other embodiments, Z¹ is O and Z² is C.

In certain embodiments, R⁷ (if present) is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R⁷ is hydrogen. In certain embodiments, R⁷ is alkyl or substituted alkyl. For example, R⁷ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In certain embodiments, R⁷ is alkenyl or substituted alkenyl. In certain embodiments, R⁷ is alkynyl or substituted alkynyl. In certain embodiments, R⁷ is alkoxy or substituted alkoxy. In certain embodiments, R⁷ is amino or substituted amino. In certain embodiments, R⁷ is carboxyl or carboxyl ester. In certain embodiments, R⁷ is acyl or acyloxy. In certain embodiments, R⁷ is acyl amino or amino acyl. In certain embodiments, R⁷ is alkylamide or substituted alkylamide. In certain embodiments, R⁷ is sulfonyl. In certain embodiments, R⁷ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R⁷ is aryl or substituted aryl. In certain embodiments, R⁷ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁷ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁷ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, L is a linker. Further description of the linker, L, is found in the disclosure herein. For example, in some embodiments, L is a linker comprising -(L¹)_(a)-(L²)_(b)-(L³)_(c)-(L⁴)_(d)-(L⁵)_(e)-, where L¹, L², L³, L⁴ and L⁵ are each a linker unit, and a, b, c, d and e are each independently 0 or 1, wherein the sum of a, b, c, d and e is 1 to 5, as described in more detail below. Other linkers are also possible, as shown described in more detail below.

In certain embodiments, W is a chemical entity, such as a drug or a detectable label. In certain embodiments, W is a drug. In certain embodiments, W is a detectable label.

In certain embodiments, the conjugate (e.g., polypeptide conjugate) includes at least one modified amino acid residue of formula (Ia):

In certain embodiments, the substituents for formula (Ia) are the same as those described for formula (I) herein. For example, in certain embodiments, the substituents for formula (Ia), e.g., R¹, R², R³, R⁴, R⁵, R⁶, L and W, are as defined in formula (I) herein. In certain embodiments of formula (Ia), R³ is the polypeptide.

In certain embodiments, the conjugate (e.g., polypeptide conjugate) includes at least one modified amino acid residue of formula (Ib):

In certain embodiments, the substituents for formula (Ib) are the same as those described for formula (I) herein. For example, in certain embodiments, the substituents for formula (Ib), e.g., R¹, R², R³, R⁴, R⁵, R⁶, R⁷, L and W, are as defined in formula (I) herein. In certain embodiments of formula (Ib), R³ is the polypeptide.

In certain embodiments, the conjugate (e.g., polypeptide conjugate) includes at least one modified amino acid residue of formula (Ic):

In certain embodiments, the substituents for formula (Ic) are the same as those described for formula (I) herein. For example, in certain embodiments, the substituents for formula (Ic), e.g., R¹, R², R³, R⁴, R⁵, R⁶, L and W, are as defined in formula (I) herein. In certain embodiments of formula (Ic), R⁴ is the polypeptide.

In certain embodiments, the conjugate (e.g., polypeptide conjugate) includes at least one modified amino acid residue having the following structure:

where L and W are as defined in formula (I) herein, and R³ is the polypeptide.

In certain embodiments, the conjugate (e.g., polypeptide conjugate) includes at least one modified amino acid residue having the following structure:

where L and W are as defined in formula (I) herein, and R³ is the polypeptide.

In certain embodiments, the conjugate (e.g., polypeptide conjugate) includes at least one modified amino acid residue having the following structure:

where L and W are as defined in formula (I) herein, and R⁴ is the polypeptide.

Compounds Useful for Producing Conjugates

The present disclosure provides hydrazinyl-substituted heteroaryl compounds and derivatives thereof useful for producing the conjugates (e.g., polypeptide conjugates) described herein. In certain embodiments, the hydrazinyl-substituted heteroaryl compound (or derivative thereof) may be a coupling moiety useful for conjugation of a polypeptide to a moiety of interest (e.g., a chemical entity such as, but not limited to, a drug or detectable label). For example, the hydrazinyl-substituted heteroaryl coupling moiety may be bound (e.g., attached through one or more covalent bonds) to the moiety of interest (e.g., drug or detectable label). Reaction between the hydrazinyl-substituted heteroaryl coupling moiety and a reactive group of the polypeptide (e.g., an fGly residue, such as a reactive aldehyde functional group of the fGly) can form one or more covalent bonds between the hydrazinyl-substituted heteroaryl coupling moiety and the polypeptide, thus attaching the polypeptide and the moiety of interest together through the coupling moiety.

In certain embodiments, the hydrazinyl-substituted heteroaryl compound is a compound of formula (II):

wherein:

R⁸ is H and R⁹ is —(CR²R⁴)(NR⁶)(NHR⁵), or R⁸ is —(CR¹R³)(NR⁵)(NHR⁶) and R⁹ is H;

R¹, R², R³ and R⁴ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl;

Z¹ is selected from CR⁷, N, O and S;

Z² is C or N;

R⁷ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl;

L is a linker; and

W is a drug or a detectable label.

In certain embodiments, R⁸ is H and R⁹ is —(CR₂R⁴)(NR⁶)(NHR⁵), or R⁸ is —(CR¹R³)(NR⁵)(NHR⁶) and R⁹ is H. In certain embodiments, R⁸ is H and R⁹ is —(CR²R⁴)(NR⁶)(NHR⁵). In certain embodiments R⁸ is —(CR¹R³)(NR⁵)(NHR⁶) and R⁹ is H.

In certain embodiments, R¹, R², R³ and R⁴ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments, R¹ is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R¹ is hydrogen. In certain embodiments, R¹ is alkyl or substituted alkyl. For instance, R¹ may be alkyl, such as C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R¹ is alkenyl or substituted alkenyl. In certain embodiments, R¹ is alkynyl or substituted alkynyl. In certain embodiments, R¹ is aryl or substituted aryl. In certain embodiments, R¹ is heteroaryl or substituted heteroaryl. In certain embodiments, R¹ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R¹ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R² is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R² is hydrogen. In certain embodiments, R² is alkyl or substituted alkyl. For instance, R² may be alkyl, such as C₁₋₁₀ alkyl, or C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R² is alkenyl or substituted alkenyl. In certain embodiments, R² is alkynyl or substituted alkynyl. In certain embodiments, R² is aryl or substituted aryl. In certain embodiments, R² is heteroaryl or substituted heteroaryl. In certain embodiments, R² is cycloalkyl or substituted cycloalkyl. In certain embodiments, R² is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R³ is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. For example, R³ may be hydrogen. In other embodiments, R³ is alkyl or substituted alkyl. For instance, R³ may be alkyl, such as C₁₋₁₀ alkyl, or C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R³ is alkenyl or substituted alkenyl. In certain embodiments, R³ is alkynyl or substituted alkynyl. In certain embodiments, R³ is aryl or substituted aryl. In certain embodiments, R³ is heteroaryl or substituted heteroaryl. In certain embodiments, R³ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R³ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R⁴ is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. For example, R⁴ may be hydrogen. In other embodiments, R⁴ is alkyl or substituted alkyl. For instance, R⁴ may be alkyl, such as C₁₋₁₀ alkyl, or C₁-C₆ alkyl, or C₁-C₃ alkyl (e.g., methyl). In certain embodiments, R⁴ is alkenyl or substituted alkenyl. In certain embodiments, R⁴ is alkynyl or substituted alkynyl. In certain embodiments, R⁴ is aryl or substituted aryl. In certain embodiments, R⁴ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁴ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁴ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl.

In certain embodiments, R⁵ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R⁵ is hydrogen. In certain embodiments, R⁵ is alkyl or substituted alkyl. In certain embodiments, R⁵ is alkenyl or substituted alkenyl. In certain embodiments, R⁵ is alkynyl or substituted alkynyl. In certain embodiments, R⁵ is alkoxy or substituted alkoxy. In certain embodiments, R⁵ is amino or substituted amino. In certain embodiments, R⁵ is carboxyl or carboxyl ester. In certain embodiments, R⁵ is acyl or acyloxy. In certain embodiments, R⁵ is acyl amino or amino acyl. In certain embodiments, R⁵ is alkylamide or substituted alkylamide. In certain embodiments, R⁵ is sulfonyl. In certain embodiments, R⁵ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R⁵ is aryl or substituted aryl. In certain embodiments, R⁵ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁵ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁵ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R⁵ is alkyl or substituted alkyl. For example, R⁵ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In some cases, R⁵ is methyl, ethyl, n-propyl, iso-propyl, n-butyl, sec-butyl, isobutyl, t-butyl, or the like. In certain cases, R⁵ is methyl.

In certain embodiments, R⁶ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R⁶ is hydrogen. In certain embodiments, R⁶ is alkyl or substituted alkyl. In certain embodiments, R⁶ is alkenyl or substituted alkenyl. In certain embodiments, R⁶ is alkynyl or substituted alkynyl. In certain embodiments, R⁶ is alkoxy or substituted alkoxy. In certain embodiments, R⁶ is amino or substituted amino. In certain embodiments, R⁶ is carboxyl or carboxyl ester. In certain embodiments, R⁶ is acyl or acyloxy. In certain embodiments, R⁶ is acyl amino or amino acyl. In certain embodiments, R⁶ is alkylamide or substituted alkylamide. In certain embodiments, R⁶ is sulfonyl. In certain embodiments, R⁶ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R⁶ is aryl or substituted aryl. In certain embodiments, R⁶ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁶ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁶ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, R⁶ is alkyl or substituted alkyl. For example, R⁶ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In some cases, R⁶ is methyl, ethyl, n-propyl, iso-propyl, n-butyl, sec-butyl, isobutyl, t-butyl, or the like. In certain cases, R⁶ is methyl.

In certain embodiments, R⁵ and R⁶ are each independently selected from alkyl and substituted alkyl. For example, R⁵ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl), and R⁶ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁-C₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In some cases, R⁵ and R⁶ are each independently selected from methyl, ethyl, n-propyl, iso-propyl, n-butyl, sec-butyl, isobutyl, t-butyl, or the like. In certain cases, R⁵ and R⁶ are each methyl.

In certain embodiments, R⁵ and R⁶ are optionally cyclically linked to form a 5 or 6-membered heterocyclyl. In some instances, R⁵ and R⁶ (together with the atoms to which they are attached) may be cyclically linked to form a 5-membered heterocyclyl. In some instances, R⁵ and R⁶ (together with the atoms to which they are attached) may be cyclically linked to form a 6-membered heterocyclyl.

In certain embodiments, Z¹ is selected from CR⁷, N, O and S. In certain embodiments, Z¹ is CR⁷. In certain embodiments, Z¹ is N. In certain embodiments, Z¹ is O. In certain embodiments, Z¹ is S.

In certain embodiments, Z² is C or N. In certain embodiments, Z² is C. In certain embodiments, Z² is N.

Various combinations of Z¹ and Z² are possible. For example, in certain embodiments, Z¹ and Z² are each N. In other instances, Z¹ is CR⁷ and Z² is N. In other embodiments, Z¹ is O and Z² is C.

In certain embodiments, R⁷ (if present) is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, R⁷ is hydrogen. In certain embodiments, R⁷ is alkyl or substituted alkyl. For example, R⁷ may be alkyl or substituted alkyl, such as, C₁₋₁₀ alkyl or C₁₋₁₀ substituted alkyl (e.g., C₁-C₆ alkyl or C₁-C₆ substituted alkyl). In certain embodiments, R⁷ is alkenyl or substituted alkenyl. In certain embodiments, R⁷ is alkynyl or substituted alkynyl. In certain embodiments, R⁷ is alkoxy or substituted alkoxy. In certain embodiments, R⁷ is amino or substituted amino. In certain embodiments, R⁷ is carboxyl or carboxyl ester. In certain embodiments, R⁷ is acyl or acyloxy. In certain embodiments, R⁷ is acyl amino or amino acyl. In certain embodiments, R⁷ is alkylamide or substituted alkylamide. In certain embodiments, R⁷ is sulfonyl. In certain embodiments, R⁷ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R⁷ is aryl or substituted aryl. In certain embodiments, R⁷ is heteroaryl or substituted heteroaryl. In certain embodiments, R⁷ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R⁷ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, L is a linker. Further description of the linker, L, is found in the disclosure herein. For example, in some embodiments, L is a linker comprising -(L¹)_(a)-(L²)_(b)-(L³)_(c)-(L⁴)_(d)-(L⁵)_(e)-, where L¹, L², L³, L⁴ and L⁵ are each a linker unit, and a, b, c, d and e are each independently 0 or 1, wherein the sum of a, b, c, d and e is 1 to 5, as described in more detail below. Other linkers are also possible, as shown described in more detail below.

In certain embodiments, W is a chemical entity, such as a drug or a detectable label. In certain embodiments, W is a drug. In certain embodiments, W is a detectable label.

In certain embodiments, the hydrazinyl-substituted heteroaryl compound is a compound of formula (IIa):

In certain embodiments, the substituents for formula (IIa) are the same as those described for formula (II) herein. For example, in certain embodiments, the substituents for formula (IIa), e.g., R², R⁴, R⁵, R⁶, R⁸, L and W, are as defined in formula (II) herein.

In certain embodiments, the hydrazinyl-substituted heteroaryl compound is a compound of formula (IIb):

In certain embodiments, the substituents for formula (IIb) are the same as those described for formula (II) herein. For example, in certain embodiments, the substituents for formula (IIb), e.g., R², R⁴, R⁵, R⁶, R⁷, R⁸, L and W, are as defined in formula (II) herein.

In certain embodiments, the hydrazinyl-substituted heteroaryl compound is a compound of formula (IIc):

In certain embodiments, the substituents for formula (IIc) are the same as those described for formula (II) herein. For example, in certain embodiments, the substituents for formula (IIc), e.g., R¹, R³, R⁵, R⁶, R⁹, L and W, are as defined in formula (II) herein.

In certain embodiments, the compound is of the following structure:

where L and W are as defined in formula (II) herein.

In certain embodiments, the compound is of the following structure:

where L and W are as defined in formula (II) herein.

In certain embodiments, the compound is of the following structure:

where L and W are as defined in formula (II) herein.

Linkers Useful for Conjugates and Compounds

The present disclosure provides linkers (L) useful for the conjugates and compounds described herein, such as conjugates and compounds of each of the formulae disclosed herein (e.g., formulae (I), (Ia), (Ib), (Ic), (II), (IIa), (IIb) and (IIc) as described herein). The linkers may be utilized to bind a coupling moiety to one or more moieties of interest and/or one or more polypeptides. In some embodiments, the linker binds a coupling moiety to either a polypeptide or a chemical entity. In some instances, the linker binds a coupling moiety to a chemical entity (e.g., a drug or a detectable label). The linker may be bound (e.g., covalently bound) to the coupling moiety (e.g., as described herein) at any convenient position. As described herein, the coupling moiety may be used to form a covalent attachment to a second moiety, such as a polypeptide (e.g., an antibody). Thus, the linker provided herein attaches the chemical entity (e.g., drug or detectable label) to the polypeptide (e.g., antibody). Stated another way, the chemical entity (e.g., drug or detectable label) is attached to the polypeptide (e.g., antibody) through the linker.

Any convenient linker may be utilized in the subject conjugates and compounds. In certain embodiments, L includes a group selected from alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl amino, alkylamide, substituted alkylamide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, L includes an alkyl or substituted alkyl group. In certain embodiments, L includes an alkenyl or substituted alkenyl group. In certain embodiments, L includes an alkynyl or substituted alkynyl group. In certain embodiments, L includes an alkoxy or substituted alkoxy group. In certain embodiments, L includes an amino or substituted amino group. In certain embodiments, L includes a carboxyl or carboxyl ester group. In certain embodiments, L includes an acyl amino group. In certain embodiments, L includes an alkylamide or substituted alkylamide group. In certain embodiments, L includes an aryl or substituted aryl group. In certain embodiments, L includes a heteroaryl or substituted heteroaryl group. In certain embodiments, L includes a cycloalkyl or substituted cycloalkyl group. In certain embodiments, L includes a heterocyclyl or substituted heterocyclyl group.

In certain embodiments, L includes a polymer. For example, the polymer may include a polyalkylene glycol and derivatives thereof, including polyethylene glycol, methoxypolyethylene glycol, polyethylene glycol homopolymers, polypropylene glycol homopolymers, copolymers of ethylene glycol with propylene glycol (e.g., where the homopolymers and copolymers are unsubstituted or substituted at one end with an alkyl group), polyvinyl alcohol, polyvinyl ethyl ethers, polyvinylpyrrolidone, combinations thereof, and the like. In certain embodiments, the polymer is a polyalkylene glycol. In certain embodiments, the polymer is a polyethylene glycol. Other linkers are also possible, as shown in the conjugates and compounds described in more detail below.

In some embodiments, L is a linker described by the formula -(L¹)_(a)-(L²)_(b)-(L³)_(c)-(L⁴)d-(L⁵)_(e)-, wherein L¹, L², L³, L⁴ and L⁵ are each independently a linker unit, and a, b, c, d and e are each independently 0 or 1, wherein the sum of a, b, c, d and e is 1 to 5.

In certain embodiments, the sum of a, b, c, d and e is 1. In certain embodiments, the sum of a, b, c, d and e is 2. In certain embodiments, the sum of a, b, c, d and e is 3. In certain embodiments, the sum of a, b, c, d and e is 4. In certain embodiments, the sum of a, b, c, d and e is 5. In certain embodiments, a, b, c, d and e are each 1. In certain embodiments, a, b, c and d are each 1, and e is 0. In certain embodiments, a, b and c are each 1, and d and e are each 0. In certain embodiments, a and b are each 1, and c, d and e are each 0. In certain embodiments, a is 1 and b, c, d and e are each O.

Any convenient linker units may be utilized in the subject linkers. Linker units of interest include, but are not limited to, units of polymers such as polyethylene glycols, polyethylenes and polyacrylates, amino acid residue(s), carbohydrate-based polymers or carbohydrate residues and derivatives thereof, polynucleotides, alkyl groups, aryl groups, heterocycle groups, cleavable linker groups, combinations thereof, and substituted versions thereof. In some embodiments, each of L¹, L², L³, L⁴ and L⁵ (if present) include one or more groups independently selected from a polyethylene glycol, a substituted polyethylene glycol, an amino acid residue, an alkyl, a substituted alkyl, an aryl, a substituted aryl, a diamine (e.g., a linking group that includes an alkylene diamine, such as ethylene diamine), a heterocyclic group, a substituted heterocyclic group, an acetal, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, and a cleavable moiety (e.g., a chemically cleavable moiety, an enzymatically cleavable moiety (such as, but not limited to, a protease cleavable moiety (e.g., a moiety comprising one or more amino acids), a glucuronidase cleavable moiety (e.g., a carbohydrate), a beta-lactamase cleavable moiety (e.g., a beta-lactam), etc.), a photocleavable moiety, and the like).

In some embodiments, L¹ (if present) comprises a polyethylene glycol, a substituted polyethylene glycol, an amino acid residue, an alkyl, a substituted alkyl, an aryl, a substituted aryl, a diamine, a heterocyclic group, a substituted heterocyclic group, an acetal, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, or a cleavable moiety. In some embodiments, L¹ includes a polyethylene glycol. In some embodiments, L¹ includes a substituted polyethylene glycol, such as, but not limited to a carboxy substituted polyethylene glycol (e.g., a polyethylene glycol comprising a carboxy group) or an ester substituted polyethylene glycol (e.g., a polyethylene glycol comprising an ester group, such as a methyl ester). In some embodiments, L¹ includes one or more amino acid residues. In some embodiments, L¹ includes an alkyl or a substituted alkyl. In some embodiments, L¹ includes an aryl or a substituted aryl. In some embodiments, L¹ includes a diamine (e.g., an alkylene diamine, such as ethylene diamine). In some embodiments, L¹ includes a heterocyclic group or a substituted heterocyclic group. In some embodiments, L¹ includes an acetal. In some embodiments, L¹ includes a disulfide. In some embodiments, L¹ includes a hydrazine. In some embodiments, L¹ includes a carbohydrate. In some embodiments, L¹ includes a beta-lactam. In some embodiments, L¹ includes an ester. In some embodiments, L¹ includes a cleavable moiety.

In some embodiments, L² (if present) comprises a polyethylene glycol, a substituted polyethylene glycol, an amino acid residue, an alkyl, a substituted alkyl, an aryl, a substituted aryl, a diamine, a heterocyclic group, a substituted heterocyclic group, an acetal, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, or a cleavable moiety. In some embodiments, L² includes a polyethylene glycol. In some embodiments, L² includes a substituted polyethylene glycol, such as, but not limited to a carboxy substituted polyethylene glycol (e.g., a polyethylene glycol comprising a carboxy group) or an ester substituted polyethylene glycol (e.g., a polyethylene glycol comprising an ester group, such as a methyl ester). In some embodiments, L² includes one or more amino acid residues. In some embodiments, L² includes an alkyl or a substituted alkyl. In some embodiments, L² includes an aryl or a substituted aryl. In some embodiments, L² includes a diamine (e.g., an alkylene diamine, such as ethylene diamine). In some embodiments, L² includes a heterocyclic group or a substituted heterocyclic group. In some embodiments, L² includes an acetal. In some embodiments, L² includes a disulfide. In some embodiments, L² includes a hydrazine. In some embodiments, L² includes a carbohydrate. In some embodiments, L² includes a beta-lactam. In some embodiments, L² includes an ester. In some embodiments, L² includes a cleavable moiety.

In some embodiments, L³ (if present) comprises a polyethylene glycol, a substituted polyethylene glycol, an amino acid residue, an alkyl, a substituted alkyl, an aryl, a substituted aryl, a diamine, a heterocyclic group, a substituted heterocyclic group, an acetal, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, or a cleavable moiety. In some embodiments, L³ includes a polyethylene glycol. In some embodiments, L³ includes a substituted polyethylene glycol, such as, but not limited to a carboxy substituted polyethylene glycol (e.g., a polyethylene glycol comprising a carboxy group) or an ester substituted polyethylene glycol (e.g., a polyethylene glycol comprising an ester group, such as a methyl ester). In some embodiments, L³ includes one or more amino acid residues. In some embodiments, L³ includes an alkyl or a substituted alkyl. In some embodiments, L³ includes an aryl or a substituted aryl. In some embodiments, L³ includes a diamine (e.g., an alkylene diamine, such as ethylene diamine). In some embodiments, L³ includes a heterocyclic group or a substituted heterocyclic group. In some embodiments, L³ includes an acetal. In some embodiments, L³ includes a disulfide. In some embodiments, L³ includes a hydrazine. In some embodiments, L³ includes a carbohydrate. In some embodiments, L³ includes a beta-lactam. In some embodiments, L³ includes an ester. In some embodiments, L³ includes a cleavable moiety.

In some embodiments, L⁴ (if present) comprises a polyethylene glycol, a substituted polyethylene glycol, an amino acid residue, an alkyl, a substituted alkyl, an aryl, a substituted aryl, a diamine, a heterocyclic group, a substituted heterocyclic group, an acetal, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, or a cleavable moiety. In some embodiments, L⁴ includes a polyethylene glycol. In some embodiments, L⁴ includes a substituted polyethylene glycol, such as, but not limited to a carboxy substituted polyethylene glycol (e.g., a polyethylene glycol comprising a carboxy group) or an ester substituted polyethylene glycol (e.g., a polyethylene glycol comprising an ester group, such as a methyl ester). In some embodiments, L⁴ includes one or more amino acid residues. In some embodiments, L⁴ includes an alkyl or a substituted alkyl. In some embodiments, L⁴ includes an aryl or a substituted aryl. In some embodiments, L⁴ includes a diamine (e.g., an alkylene diamine, such as ethylene diamine). In some embodiments, L⁴ includes a heterocyclic group or a substituted heterocyclic group. In some embodiments, L⁴ includes an acetal. In some embodiments, L¹ includes a disulfide. In some embodiments, L⁴ includes a hydrazine. In some embodiments, L⁴ includes a carbohydrate. In some embodiments, L⁴ includes a beta-lactam. In some embodiments, L⁴ includes an ester. In some embodiments, L⁴ includes a cleavable moiety.

In some embodiments, L⁵ (if present) comprises a polyethylene glycol, a substituted polyethylene glycol, an amino acid residue, an alkyl, a substituted alkyl, an aryl, a substituted aryl, a diamine, a heterocyclic group, a substituted heterocyclic group, an acetal, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, or a cleavable moiety. In some embodiments, L⁵ includes a polyethylene glycol. In some embodiments, L⁵ includes a substituted polyethylene glycol, such as, but not limited to a carboxy substituted polyethylene glycol (e.g., a polyethylene glycol comprising a carboxy group) or an ester substituted polyethylene glycol (e.g., a polyethylene glycol comprising an ester group, such as a methyl ester). In some embodiments, L⁵ includes one or more amino acid residues. In some embodiments, L⁵ includes an alkyl or a substituted alkyl. In some embodiments, L⁵ includes an aryl or a substituted aryl. In some embodiments, L⁵ includes a diamine (e.g., an alkylene diamine, such as ethylene diamine). In some embodiments, L⁵ includes a heterocyclic group or a substituted heterocyclic group. In some embodiments, L⁵ includes an acetal. In some embodiments, L⁵ includes a disulfide. In some embodiments, L⁵ includes a hydrazine. In some embodiments, L⁵ includes a carbohydrate. In some embodiments, L⁵ includes a beta-lactam. In some embodiments, L⁵ includes an ester. In some embodiments, L⁵ includes a cleavable moiety.

Any convenient cleavable moiety may be utilized as a cleavable linker unit in the subject conjugates and compounds. In certain embodiments, the cleavable moiety is a para-amino-benzyloxycarbonyl group (PABC), a meta-amino-benzyloxycarbonyl group (MABC), a para-amino-benzyloxy group (PABO), a meta-amino-benzyloxy group (MABO), para-aminobenzyl (PAB), an acetal group, a disulfide, a hydrazine, a protease-cleavable moiety (e.g., e.g., a sequence of amino acids, such as a Cat B cleavable moiety), a glucuronidase cleavable moiety (e.g., a carbohydrate), a beta-lactamase cleavable moiety (e.g., a beat-lactam), or an ester.

In some embodiments, L is a linker of the formula -(L1)_(a)-(L²)_(b)-(L³)_(c)-(L⁴)_(d)-(L⁵)_(e)-, where:

-(L¹)_(a)- is -(T¹-V¹)_(a)-;

-(L²)_(b)- is -(T²-V²)_(b)-;

-(L³)_(c)- is -(T³-V³)_(c)-;

-(L⁴)_(d)- is -(T⁴-V⁴)_(d)-; and

-(L⁵)_(e)- is -(T⁵-V⁵)_(e)-;

where T¹, T², T³, T⁴ and T⁵, if present, are each tether groups;

V¹, V², V³, V⁴ and V⁵, if present, are each independently a covalent bond or a linking functional group; and

a, b, c, d and e are each independently 0 or 1, where the sum of a, b, c, d and e is 1 to 5.

Regarding the tether groups, T¹, T², T³, T⁴ and T⁵, any convenient tether groups may be utilized in the subject linkers. In some embodiments, T¹, T², T³, T⁴ and T⁵ each comprise one or more groups independently selected from a (C₁-C₁₂)alkyl, a substituted (C₁-C₁₂)alkyl, an (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4-amino-piperidinyl (4AP), para-aminobenzyl (PAB), para-amino-benzyloxy (PABO), meta-amino-benzyloxy (MABO), para-amino-benzyloxycarbonyl (PABC), meta-amino-benzyloxycarbonyl (MABC), an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, a protease-cleavable moiety, a glucuronidase cleavable moiety, a beta-lactamase cleavable moiety, and an ester, where w is an integer from 1 to 20, n is an integer from 1 to 30, p is an integer from 1 to 20, and h is an integer from 1 to 12.

In some embodiments, T¹ includes one or more groups independently selected from a (C₁-C₁₂)alkyl, a substituted (C₁-C₁₂)alkyl, an (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4AP, PAB, PABO, MABO, PABC, MABC, an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, a protease-cleavable moiety, a glucuronidase cleavable moiety, a beta-lactamase cleavable moiety, and an ester. In some embodiments, T¹ includes a (C₁-C₁₂)alkyl or a substituted (C₁-C₁₂)alkyl, such as a (C₁-C₆)alkyl or a substituted (C₁-C₆)alkyl or a (C₁-C₃)alkyl or a substituted (C₁-C₃)alkyl. In some embodiments, T¹ includes an (EDA)_(w). In some embodiments, T¹ includes a (PEG)_(n). In some embodiments, T¹ includes an (AA)_(p). In some embodiments, T¹ includes a —(CR¹³OH)_(h)—. In some embodiments, T¹ includes a 4AP. In some embodiments, T¹ includes a PAB. In some embodiments, T¹ includes a PABO. In some embodiments, T¹ includes a MABO. In some embodiments, T¹ includes a PABC. In some embodiments, T¹ includes a MABC. In some embodiments, T¹ includes an acetal group. In some embodiments, T¹ includes a disulfide. In some embodiments, T¹ includes a hydrazine. In some embodiments, T¹ includes a carbohydrate. In some embodiments, T¹ includes a beta-lactam. In some embodiments, T¹ includes a protease-cleavable moiety (e.g., a sequence of one or more amino acids). In some embodiments, T¹ includes a glucuronidase cleavable moiety (e.g., a carbohydrate). In some embodiments, T¹ includes a beta-lactamase cleavable moiety (e.g., a beta-lactam). In some embodiments, T¹ includes and an ester.

In some embodiments, T² includes one or more groups independently selected from a (C₁-C₁₂)alkyl, a substituted (C₁-C₁₂)alkyl, an (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4AP, PAB, PABO, MABO, PABC, MABC, an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, a protease-cleavable moiety, a glucuronidase cleavable moiety, a beta-lactamase cleavable moiety, and an ester. In some embodiments, T² includes a (C₁-C₁₂)alkyl or a substituted (C₁-C₁₂)alkyl, such as a (C₁-C₆)alkyl or a substituted (C₁-C₆)alkyl or a (C₁-C₃)alkyl or a substituted (C₁-C₃)alkyl. In some embodiments, T² includes an (EDA)_(w). In some embodiments, T² includes a (PEG)_(n). In some embodiments, T² includes an (AA)_(p). In some embodiments, T² includes a —(CR¹³OH)_(h)—. In some embodiments, T² includes a 4AP. In some embodiments, T² includes a PAB. In some embodiments, T² includes a PABO. In some embodiments, T² includes a MABO. In some embodiments, T² includes a PABC. In some embodiments, T² includes a MABC. In some embodiments, T² includes an acetal group. In some embodiments, T² includes a disulfide. In some embodiments, T² includes a hydrazine. In some embodiments, T² includes a carbohydrate. In some embodiments, T² includes a beta-lactam. In some embodiments, T² includes a protease-cleavable moiety (e.g., a sequence of one or more amino acids). In some embodiments, T² includes a glucuronidase cleavable moiety (e.g., a carbohydrate). In some embodiments, T² includes a beta-lactamase cleavable moiety (e.g., a beta-lactam). In some embodiments, T² includes and an ester.

In some embodiments, T³ includes one or more groups independently selected from a (C₁-C₁₂)alkyl, a substituted (C₁-C₁₂)alkyl, an (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4AP, PAB, PABO, MABO, PABC, MABC, an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, a protease-cleavable moiety, a glucuronidase cleavable moiety, a beta-lactamase cleavable moiety, and an ester. In some embodiments, T³ includes a (C₁-C₁₂)alkyl or a substituted (C₁-C₁₂)alkyl, such as a (C₁-C₆)alkyl or a substituted (C₁-C₆)alkyl or a (C₁-C₃)alkyl or a substituted (C₁-C₃)alkyl. In some embodiments, T³ includes an (EDA)_(w). In some embodiments, T³ includes a (PEG)_(n). In some embodiments, T³ includes an (AA)_(p). In some embodiments, T³ includes a —(CR¹³OH)_(h)—. In some embodiments, T³ includes a 4AP. In some embodiments, T³ includes a PAB. In some embodiments, T³ includes a PABO. In some embodiments, T³ includes a MABO. In some embodiments, T³ includes a PABC. In some embodiments, T³ includes a MABC. In some embodiments, T³ includes an acetal group. In some embodiments, T³ includes a disulfide. In some embodiments, T³ includes a hydrazine. In some embodiments, T³ includes a carbohydrate. In some embodiments, T³ includes a beta-lactam. In some embodiments, T³ includes a protease-cleavable moiety (e.g., a sequence of one or more amino acids). In some embodiments, T³ includes a glucuronidase cleavable moiety (e.g., a carbohydrate). In some embodiments, T³ includes a beta-lactamase cleavable moiety (e.g., a beta-lactam). In some embodiments, T³ includes and an ester.

In some embodiments, T⁴ includes one or more groups independently selected from a (C₁-C₁₂)alkyl, a substituted (C₁-C₁₂)alkyl, an (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4AP, PAB, PABO, MABO, PABC, MABC, an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, a protease-cleavable moiety, a glucuronidase cleavable moiety, a beta-lactamase cleavable moiety, and an ester. In some embodiments, T⁴ includes a (C₁-C₁₂)alkyl or a substituted (C₁-C₁₂)alkyl, such as a (C₁-C₆)alkyl or a substituted (C₁-C₆)alkyl or a (C₁-C₃)alkyl or a substituted (C₁-C₃)alkyl. In some embodiments, T⁴ includes an (EDA)_(w). In some embodiments, T⁴ includes a (PEG)_(n). In some embodiments, T⁴ includes an (AA)_(p). In some embodiments, T⁴ includes a —(CR¹³OH)_(h)—. In some embodiments, T⁴ includes a 4AP. In some embodiments, T⁴ includes a PAB. In some embodiments, T⁴ includes a PABO. In some embodiments, T⁴ includes a MABO. In some embodiments, T⁴ includes a PABC. In some embodiments, T⁴ includes a MABC. In some embodiments, T⁴ includes an acetal group. In some embodiments, T⁴ includes a disulfide. In some embodiments, T⁴ includes a hydrazine. In some embodiments, T⁴ includes a carbohydrate. In some embodiments, T⁴ includes a beta-lactam. In some embodiments, T⁴ includes a protease-cleavable moiety (e.g., a sequence of one or more amino acids). In some embodiments, T⁴ includes a glucuronidase cleavable moiety (e.g., a carbohydrate). In some embodiments, T⁴ includes a beta-lactamase cleavable moiety (e.g., a beta-lactam). In some embodiments, T⁴ includes and an ester.

In some embodiments, T⁵ includes one or more groups independently selected from a (C₁-C₁₂)alkyl, a substituted (C₁-C₁₂)alkyl, an (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4AP, PAB, PABO, MABO, PABC, MABC, an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, a protease-cleavable moiety, a glucuronidase cleavable moiety, a beta-lactamase cleavable moiety, and an ester. In some embodiments, T⁵ includes a (C₁-C₁₂)alkyl or a substituted (C₁-C₁₂)alkyl, such as a (C₁-C₆)alkyl or a substituted (C₁-C₆)alkyl or a (C₁-C₃)alkyl or a substituted (C₁-C₃)alkyl. In some embodiments, T⁵ includes an (EDA)_(w). In some embodiments, T⁵ includes a (PEG)_(n). In some embodiments, T⁵ includes an (AA)_(p). In some embodiments, T⁵ includes a —(CR¹³OH)_(h)—. In some embodiments, T⁵ includes a 4AP. In some embodiments, T⁵ includes a PAB. In some embodiments, T⁵ includes a PABO. In some embodiments, T⁵ includes a MABO. In some embodiments, T⁵ includes a PABC. In some embodiments, T⁵ includes a MABC. In some embodiments, T⁵ includes an acetal group. In some embodiments, T⁵ includes a disulfide. In some embodiments, T⁵ includes a hydrazine. In some embodiments, T⁵ includes a carbohydrate. In some embodiments, T⁵ includes a beta-lactam. In some embodiments, T⁵ includes a protease-cleavable moiety (e.g., a sequence of one or more amino acids). In some embodiments, T⁵ includes a glucuronidase cleavable moiety (e.g., a carbohydrate). In some embodiments, T⁵ includes a beta-lactamase cleavable moiety (e.g., a beta-lactam). In some embodiments, T⁵ includes and an ester.

In some instances, T¹, T², T³, T⁴ and T⁵ may each include a combination of one or more tether groups described above, such as, but not limited to combinations such as (AA)_(p)-MABO, (AA)_(p)-MABC, (AA)_(p)-PABO, (AA)_(p)-PABC, MABO-(AA)_(p), MABC-(AA)_(p), PABO-(AA)_(p), PABC-(AA)_(p), (AA)_(p)-MABO-(AA)_(p), (AA)_(p)-MABC-(AA)_(p), (AA)_(p)-PABO-(AA)_(p), and (AA)_(p)-PABC-(AA)_(p).

For example, T¹ may include a tether group, such as (AA)_(p)-MABO. In some embodiments, T¹ includes (AA)_(p)-MABC. In some embodiments, T¹ includes (AA)_(p)-PABO. In some embodiments, T¹ includes (AA)_(p)-PABC. In some embodiments, T¹ includes MABO-(AA)_(p). In some embodiments, T¹ includes MABC-(AA)_(p). In some embodiments, T¹ includes PABO-(AA)_(p). In some embodiments, T¹ includes PABC-(AA)_(p). In some embodiments, T¹ includes (AA)_(p)-MABO-(AA)_(p). In some embodiments, T¹ includes (AA)_(p)-MABC-(AA)_(p). In some embodiments, T¹ includes (AA)_(p)-PABO-(AA)_(p). In some embodiments, T¹ includes (AA)_(p)-PABC-(AA)_(p).

In some embodiments, T² includes a tether group, such as (AA)_(p)-MABO. In some embodiments, T² includes (AA)_(p)-MABC. In some embodiments, T² includes (AA)_(p)-PABO. In some embodiments, T² includes (AA)_(p)-PABC. In some embodiments, T² includes MABO-(AA)_(p). In some embodiments, T² includes MABC-(AA)_(p). In some embodiments, T² includes PABO-(AA)_(p). In some embodiments, T² includes PABC-(AA)_(p). In some embodiments, T² includes (AA)_(p)-MABO-(AA)_(p). In some embodiments, T² includes (AA)_(p)-MABC-(AA)_(p). In some embodiments, T² includes (AA)_(p)-PABO-(AA)_(p). In some embodiments, T² includes (AA)_(p)-PABC-(AA)_(p).

In some embodiments, T³ includes a tether group, such as (AA)_(p)-MABO. In some embodiments, T³ includes (AA)_(p)-MABC. In some embodiments, T³ includes (AA)_(p)-PABO. In some embodiments, T³ includes (AA)_(p)-PABC. In some embodiments, T³ includes MABO-(AA)_(p). In some embodiments, T³ includes MABC-(AA)_(p). In some embodiments, T³ includes PABO-(AA)_(p). In some embodiments, T³ includes PABC-(AA)_(p). In some embodiments, T³ includes (AA)_(p)-MABO-(AA)_(p). In some embodiments, T³ includes (AA)_(p)-MABC-(AA)_(p). In some embodiments, T³ includes (AA)_(p)-PABO-(AA)_(p). In some embodiments, T³ includes (AA)_(p)-PABC-(AA)_(p).

In some embodiments, T⁴ includes a tether group, such as (AA)_(p)-MABO. In some embodiments, T⁴ includes (AA)_(p)-MABC. In some embodiments, T⁴ includes (AA)_(p)-PABO. In some embodiments, T⁴ includes (AA)_(p)-PABC. In some embodiments, T⁴ includes MABO-(AA)_(p). In some embodiments, T⁴ includes MABC-(AA)_(p). In some embodiments, T⁴ includes PABO-(AA)_(p). In some embodiments, T⁴ includes PABC-(AA)_(p). In some embodiments, T⁴ includes (AA)_(p)-MABO-(AA)_(p). In some embodiments, T⁴ includes (AA)_(p)-MABC-(AA)_(p). In some embodiments, T⁴ includes (AA)_(p)-PABO-(AA)_(p). In some embodiments, T⁴ includes (AA)_(p)-PABC-(AA)_(p).

In some embodiments, T⁵ includes a tether group, such as (AA)_(p)-MABO. In some embodiments, T⁵ includes (AA)_(p)-MABC. In some embodiments, T⁵ includes (AA)_(p)-PABO. In some embodiments, T⁵ includes (AA)_(p)-PABC. In some embodiments, T⁵ includes MABO-(AA)_(p). In some embodiments, T⁵ includes MABC-(AA)_(p). In some embodiments, T⁵ includes PABO-(AA)_(p). In some embodiments, T⁵ includes PABC-(AA)_(p). In some embodiments, T⁵ includes (AA)_(p)-MABO-(AA)_(p). In some embodiments, T⁵ includes (AA)_(p)-MABC-(AA)_(p). In some embodiments, T⁵ includes (AA)_(p)-PABO-(AA)_(p). In some embodiments, T⁵ includes (AA)_(p)-PABC-(AA)_(p).

As described above, the tether group may include an ethylene diamine (EDA) moiety, e.g., an EDA containing tether group. In certain embodiments, (EDA)_(w) includes one or more EDA moieties, such as where w is an integer from 1 to 50, such as from 1 to 40, from 1 to 30, from 1 to 20, from 1 to 12 or from 1 to 6, such as 1, 2, 3, 4, 5 or 6. The ethylene diamine (EDA) moieties may optionally be substituted at one or more convenient positions with any convenient substituents, e.g., with an alkyl, a substituted alkyl, an acyl, a substituted acyl, an aryl or a substituted aryl. In certain embodiments, the EDA moiety is described by the structure:

where q is an integer from 1 to 6 and r is 0 or 1 and each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or any two adjacent R¹² groups may be cyclically linked to form a piperazinyl ring.

In certain embodiments, q is 1, 2, 3, 4, 5 or 6. In certain embodiments, q is 1 and r is 0. In certain embodiments, q is 1 and r is 1. In certain embodiments, q is 2 and r is 0. In certain embodiments, q is 2 and r is 1.

In certain embodiments, each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or wherein two adjacent R¹² groups are cyclically linked to form a piperazinyl ring. In certain embodiments, each R¹² is independently selected from hydrogen, an alkyl, a substituted alkyl, an aryl and a substituted aryl. In certain embodiments, any two adjacent R¹² groups of the EDA may be cyclically linked, e.g., to form a piperazinyl ring. In certain embodiments, q is 1 and the two adjacent R¹² groups are a cyclically linked alkeylene group, thus forming a piperazinyl ring. In certain embodiments, q is 1 and the adjacent R¹² groups are selected from hydrogen, an alkyl (e.g., methyl) and a substituted alkyl (e.g., lower alkyl-OH, such as ethyl-OH or propyl-OH).

As described above, the tether group may include a 4-amino-piperidinyl (4AP) moiety (also referred to as a piperidin-4-amino (P4A) moiety). The 4AP moiety may optionally be substituted at one or more convenient positions with any convenient substituents, e.g., with an alkyl, a substituted alkyl, a polyethylene glycol, a substituted polyethylene glycol, an acyl, a substituted acyl, an aryl or a substituted aryl. In certain embodiments, the 4AP moiety is described by the structure:

where R¹⁴ is selected from hydrogen, alkyl, substituted alkyl, a polyethylene glycol moiety (e.g., a polyethylene glycol or a substituted polyethylene glycol), alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments, each R¹⁴ is independently selected from hydrogen, alkyl, substituted alkyl, polyethylene glycol, substituted polyethylene glycol, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments, R¹⁴ is a polyethylene glycol. In certain embodiments, R¹⁴ is a substituted polyethylene glycol, such as, but not limited to a carboxy substituted polyethylene glycol (e.g., a polyethylene glycol that includes a carboxy substituent, such as a carboxy substituent at one end of the polyethylene glycol) or an ester substituted polyethylene glycol (e.g., a polyethylene glycol that includes an ester substituent, such as a methyl ester, such as an ester substituent as one end of the polyethylene glycol). The polyethylene glycol or substituted polyethylene glycol substituent at the R¹⁴ position can include one or more polyethylene glycol subunits, such as 1 to 50, or 1 to 40, or 1 to 30, or 1 to 20, or 1 to 12, or 1 to 10, or 1 to 6, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 polyethylene glycol subunits. In some instances, the polyethylene glycol or substituted polyethylene glycol substituent at the R¹⁴ position includes 3 polyethylene glycol subunits. In some instances, the polyethylene glycol or substituted polyethylene glycol substituent at the R¹⁴ position includes 6 polyethylene glycol subunits. In some instances, the polyethylene glycol or substituted polyethylene glycol substituent at the R¹⁴ position includes 8 polyethylene glycol subunits. In some instances, the polyethylene glycol or substituted polyethylene glycol substituent at the R¹⁴ position includes 10 polyethylene glycol subunits. In some instances, the polyethylene glycol or substituted polyethylene glycol substituent at the R¹⁴ position includes 12 polyethylene glycol subunits.

As described above, the tether group may include a (PEG)_(n) group, where PEG is a polyethylene glycol or a substituted polyethylene glycol. In certain embodiments, (PEG)_(n) is described by the structure:

where n is an integer from 1 to 50, such as from 1 to 40, from 1 to 30, from 1 to 20, from 1 to 12 or from 1 to 6, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20. In some instances, n is 3. In some instances, n is 6. In some instances, n is 8. In some instances, n is 10. In some instances, n is 12.

In certain embodiments, a tether group includes (AA)_(p), where AA is an amino acid residue. Any convenient amino acids may be utilized. Amino acids of interest include but are not limited to, L- and D-amino acids, naturally occurring amino acids such as any of the 20 primary alpha-amino acids and beta-alanine, non-naturally occurring amino acids (e.g., amino acid analogs), such as a non-naturally occurring alpha-amino acid or a non-naturally occurring beta-amino acid, etc. In certain embodiments, p is 1. In certain embodiments, p is an integer from 1 to 50, such as from 1 to 40, from 1 to 30, from 1 to 20, from 1 to 12 or from 1 to 6, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20.

In certain embodiments, a tether group includes a moiety described by the formula —(CR¹³OH)_(h)—, where h is 0 or n is an integer from 1 to 50, such as from 1 to 40, from 1 to 30, from 1 to 20, from 1 to 12 or from 1 to 6, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12.

In certain embodiments, R¹³ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments, each R¹³ is independently selected from hydrogen, alkyl, substituted alkyl, aryl, and substituted aryl.

In certain embodiments, R¹³ is hydrogen. In certain embodiments, R¹³ is alkyl or substituted alkyl. In certain embodiments, R¹³ is alkenyl or substituted alkenyl. In certain embodiments, R¹³ is alkynyl or substituted alkynyl. In certain embodiments, R¹³ is alkoxy or substituted alkoxy. In certain embodiments, R¹³ is amino or substituted amino. In certain embodiments, R¹³ is carboxyl or carboxyl ester. In certain embodiments, R¹³ is acyl or acyloxy. In certain embodiments, R¹³ is acyl amino or amino acyl. In certain embodiments, R¹³ is alkylamide or substituted alkylamide. In certain embodiments, R¹³ is sulfonyl. In certain embodiments, R¹³ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R¹³ is aryl or substituted aryl. In certain embodiments, R¹³ is heteroaryl or substituted heteroaryl. In certain embodiments, R¹³ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R¹³ is heterocyclyl or substituted heterocyclyl. In certain embodiments, R¹³ is selected from hydrogen, an alkyl, a substituted alkyl, an aryl, and a substituted aryl.

In some embodiments, a tether group includes a para-aminobenzyl (PAB) group described by the following structure:

In some embodiments, a tether group includes a para-aminobenzyloxy (PABO) group described by the following structure:

In some embodiments, a tether group includes a para-amino-benzyloxycarbonyl (PABC) group described by the following structure:

In some embodiments of the PAB, PABO, and PABC tether groups shown above, R¹⁵ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments of PAB, PABO, and PABC, R¹⁵ is selected from hydrogen, an alkyl, a substituted alkyl, an aryl and a substituted aryl. In certain embodiments of PAB, PABO, and PABC, R¹⁵ is selected from hydrogen, an alkyl (e.g., methyl) and a substituted alkyl (e.g., lower alkyl-OH, such as ethyl-OH or propyl-OH). In some embodiments, any of the PAB, PABO, and PABC tether groups shown above may be further substituted with one or more convenient aryl and/or alkyl substituents. In certain embodiments of PAB, PABO, and PABC, R¹⁵ is hydrogen. The divalent PAB, PABO, and PABC tether groups may be covalently bound to adjacent moieties via any convenient chemistries.

In some embodiments, a tether group includes a meta-amino-benzyloxy (MABO) group described by the following structure:

In some embodiments, a tether group includes a meta-amino-benzyloxycarbonyl (MABC) group described by the following structure:

In some embodiments of the MABO and MABC tether groups shown above, R¹⁵ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In certain embodiments of MABO and MABC, R¹⁵ is selected from hydrogen, an alkyl, a substituted alkyl, an aryl and a substituted aryl. In certain embodiments of MABO and MABC, R¹⁵ is selected from hydrogen, an alkyl (e.g., methyl) and a substituted alkyl (e.g., lower alkyl-OH, such as ethyl-OH or propyl-OH). In some embodiments, any of the MABO and MABC tether groups shown above may be further substituted with one or more convenient aryl and/or alkyl substituents. In certain embodiments of MABO and MABC, R¹⁵ is hydrogen. The divalent MABO and MABC tether groups may be covalently bound to adjacent moieties via any convenient chemistries.

In some embodiments of the MABO and MABC tether groups shown above, R¹⁶ is selected from hydrogen, halogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In some embodiments of the MABO and MABC tether groups shown above, R¹⁶ is a carbohydrate or carbohydrate derivative, such as, but not limited to, α-D-glucose or β-D-glucose.

Regarding V¹, V², V³, V⁴ and V⁵, any convenient linking functional groups may be utilized in the subject linkers. Linking functional groups of interest include, but are not limited to, amino, carbonyl, amido, oxycarbonyl, carboxy, sulfonyl, sulfoxide, sulfonylamino, aminosulfonyl, thio, oxy, phospho, phosphoramidate, thiophosphoraidate, and the like. In some embodiments, V¹, V², V³, V⁴ and V⁵ are each independently selected from the group consisting of a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—.

In some embodiments, V¹ is selected from a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—. In some embodiments, V¹ is a covalent bond. In some embodiments, V¹ is —CO—. In some embodiments, V¹ is —NR¹¹—. In some embodiments, V¹ is —CONR¹¹—. In some embodiments, V¹ is —NR¹¹CO—. In some embodiments, V¹ is —C(O)O—. In some embodiments, V¹ is —OC(O)—. In some embodiments, V¹ is —O—. In some embodiments, V¹ is —S—. In some embodiments, V¹ is —S(O)—. In some embodiments, V¹ is —SO₂—. In some embodiments, V¹ is —SO₂NR¹¹—. In some embodiments, V¹ is —NR¹¹SO₂—. In some embodiments, V¹ is —P(O)OH—.

In some embodiments, V² is selected from a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—. In some embodiments, V² is a covalent bond. In some embodiments, V² is —CO—. In some embodiments, V² is —NR¹¹—. In some embodiments, V² is —CONR¹¹—. In some embodiments, V² is —NR¹¹CO—. In some embodiments, V² is —C(O)O—. In some embodiments, V² is —OC(O)—. In some embodiments, V² is —O—. In some embodiments, V² is —S—. In some embodiments, V² is —S(O)—. In some embodiments, V² is —SO₂—. In some embodiments, V² is —SO₂NR¹¹—. In some embodiments, V² is —NR¹¹SO₂—. In some embodiments, V² is —P(O)OH—.

In some embodiments, V³ is selected from a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—. In some embodiments, V³ is a covalent bond. In some embodiments, V³ is —CO—. In some embodiments, V³ is —NR¹¹—. In some embodiments, V³ is —CONR¹¹—. In some embodiments, V³ is —NR¹¹CO—. In some embodiments, V³ is —C(O)O—. In some embodiments, V³ is —OC(O)—. In some embodiments, V³ is —O—. In some embodiments, V³ is —S—. In some embodiments, V³ is —S(O)—. In some embodiments, V³ is —SO₂—. In some embodiments, V³ is —SO₂NR¹¹—. In some embodiments, V³ is —NR¹¹SO₂—. In some embodiments, V³ is —P(O)OH—.

In some embodiments, V⁴ is selected from a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—. In some embodiments, V⁴ is a covalent bond. In some embodiments, V⁴ is —CO—. In some embodiments, V⁴ is —NR¹¹—. In some embodiments, V⁴ is —CONR¹¹—. In some embodiments, V⁴ is —NR¹¹CO—. In some embodiments, V⁴ is —C(O)O—. In some embodiments, V⁴ is —OC(O)—. In some embodiments, V⁴ is —O—. In some embodiments, V⁴ is —S—. In some embodiments, V⁴ is —S(O)—. In some embodiments, V⁴ is —SO₂—. In some embodiments, V⁴ is —SO₂NR¹¹—. In some embodiments, V⁴ is —NR¹¹SO₂—. In some embodiments, V⁴ is —P(O)OH—.

In some embodiments, V⁵ is selected from a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO₂NR¹¹—, —NR¹¹SO₂— and —P(O)OH—. In some embodiments, V⁵ is a covalent bond. In some embodiments, V⁵ is —CO—. In some embodiments, V⁵ is —NR¹¹—. In some embodiments, V⁵ is —CONR¹¹—. In some embodiments, V⁵ is —NR¹¹CO—. In some embodiments, V⁵ is —C(O)O—. In some embodiments, V⁵ is —OC(O)—. In some embodiments, V⁵ is —O—. In some embodiments, V⁵ is —S—. In some embodiments, V⁵ is —S(O)—. In some embodiments, V⁵ is —SO₂—. In some embodiments, V⁵ is —SO₂NR¹¹—. In some embodiments, V⁵ is —NR¹¹SO₂—. In some embodiments, V⁵ is —P(O)OH—.

In some embodiments of V¹, V², V³, V⁴ and V⁵, each R¹¹ is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl. In some embodiments, each R¹¹ is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments, R¹¹ is hydrogen. In certain embodiments, R¹¹ is alkyl or substituted alkyl. In certain embodiments, R¹¹ is alkenyl or substituted alkenyl. In certain embodiments, R¹¹ is alkynyl or substituted alkynyl. In certain embodiments, R¹¹ is alkoxy or substituted alkoxy. In certain embodiments, R¹¹ is carboxyl or carboxyl ester. In certain embodiments, R¹¹ is acyl or acyloxy. In certain embodiments, R¹¹ is acyl amino or amino acyl. In certain embodiments, R¹¹ is alkylamide or substituted alkylamide. In certain embodiments, R¹¹ is sulfonyl. In certain embodiments, R¹¹ is thioalkoxy or substituted thioalkoxy. In certain embodiments, R¹¹ is aryl or substituted aryl. In certain embodiments, R¹¹ is heteroaryl or substituted heteroaryl. In certain embodiments, R¹¹ is cycloalkyl or substituted cycloalkyl. In certain embodiments, R¹¹ is heterocyclyl or substituted heterocyclyl.

In certain embodiments, T¹, T², T³, T⁴ and T⁵ and V¹, V², V³, V⁴ and V⁵ are selected from the following table, e.g., one row of the following table:

T¹ V¹ T² V² T³ V³ T⁴ V⁴ T⁵ V⁵ (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —NR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (EDA)_(w) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —SO₂— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (CR¹³OH)_(h) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— — — (C₁-C₁₂)alkyl —CONR¹¹— substituted —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl (C₁-C₁₂)alkyl —SO2— (C₁-C₁₂)alkyl —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl — (CR¹³OH)_(h) —CONR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —P(O)OH— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— (C₁-C₁₂)alkyl —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- —NR¹¹— (C₁-C₁₂)alkyl —CO— PABC (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- — — — PABC- (AA)_(p) (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC- — (AA)_(p) (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABO — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABO — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC —NR¹¹— — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC- — (AA)_(p) (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — —.

As described above, in some embodiments, L is a linker comprising -(L1)_(a)-(L²)_(b)-(L³)_(c)-(L⁴)_(d)-(L⁵)_(e), where -(L¹)_(a)- is -(T¹-V¹)_(a)-; -(L²)_(b)- is -(T²-V²)_(b)-; -(L³)_(c)- is -(T³-V³)_(c)-; -(L⁴)_(d)- is -(T⁴-V⁴)_(d)-; and -(L⁵)_(e)- is -(T⁵-V⁵)_(e)-.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (EDA)_(w), V² is —CO—, T³ is (CR¹³OH)_(h), V³ is —CONR¹¹—, T⁴ is (C₁-C₁₂)alkyl, V⁴ is —CO—, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is absent, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —NR¹¹—, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V3 is —NR¹¹—, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V is —CO—, T² is (EDA)_(w), V² is —CO—, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (C₁-C₁₂)alkyl, V² is —NR¹¹—, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (EDA)_(w), V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V is —CO—, T² is (EDA)_(w), V² is absent, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (EDA)_(w), V² is —CO—, T³ is (CR¹³OH)_(h), V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —SO₂— T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (CR¹³OH)_(h), V² is —CO—, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (EDA)_(w), V² is —CO—, T³ is (CR¹³OH)_(h), V³ is —CONR¹¹—, T⁴ is (PEG)_(n), V⁴ is —CO—, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is substituted (C₁-C₁₂)alkyl, V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —SO₂—, T² is (C₁-C₁₂)alkyl, V² is —CO—, T³ is absent, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (C₁-C₁₂)alkyl, V² is absent, T³ is (CR¹³OH)_(h), V³ is —CONR¹¹—, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is (AA)_(p), V⁴ is —NR¹¹—, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —P(O)OH—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (EDA)_(w), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (EDA)_(w), V² is —CO—, T³ is (CR¹³OH)_(h), V³ is —CONR¹¹—, T⁴ is (C₁-C₁₂)alkyl, V⁴ is —CO—, T⁵ is (AA)_(p), and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (C₁-C₁₂)alkyl, V² is —NR¹¹—, T³ is absent, V³ is —CO—, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (C₁-C₁₂)alkyl, V² is —NR¹¹—, T³ is absent, V³ is —CO—, T⁴ is (C₁-C₁₂)alkyl, V⁴ is —NR¹¹—, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (EDA)_(w), V² is —CO—, T³ is (CR¹³OH)_(h), V³ is —CONR¹¹—, T⁴ is (PEG)_(n), V⁴ is —CO—, T⁵ is (AA)_(p), and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is 4AP, V² is —CO—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is 4AP, V² is —CO—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is MABO, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is MABC, V⁴ is absent, T⁵ is (AA)_(p), and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is (AA)_(p)-PABC, V⁴ is —NR¹¹—, T⁵ is (C₁-C₁₂)alkyl, and V⁵ is —CO—.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is PABC, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is PABO, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is (AA)_(p)-PABC-(AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is PABC-(AA)_(p), and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is PABO, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is 4AP, V² is —CO—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is PABO, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is MABO, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is PABC, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is MABC, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is PABC, V⁴ is absent, T⁵ is (AA)_(p), and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is MABO, V³ is absent, T⁴ is absent, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is PABO, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is (AA)_(p), V² is —NR¹¹—, T³ is (PEG)_(n), V³ is —CO—, T⁴ is PABC, V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is MABC, V³ is absent, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CONR¹¹—, T² is (PEG)_(n), V² is —CO—, T³ is (AA)_(p), V³ is absent, T⁴ is PABC, V⁴ is —NR¹¹—, T⁵ is absent, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is 4AP, V² is —CO—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is PABC, and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is 4AP, V² is —CO—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is PABC-(AA)_(p), and V⁵ is absent.

In certain embodiments, T¹ is (C₁-C₁₂)alkyl, V¹ is —CO—, T² is 4AP, V² is —CO—, T³ is (C₁-C₁₂)alkyl, V³ is —CO—, T⁴ is (AA)_(p), V⁴ is absent, T⁵ is absent, and V⁵ is absent.

In some embodiments, the linker is described by one of the following structures:

wherein:

each f is independently 0 or an integer from 1 to 12;

each n is independently 0 or an integer from 1 to 30;

each y is independently 0 or an integer from 1 to 20;

each h is independently 0 or an integer from 1 to 12;

each p is independently 0 or an integer from 1 to 20;

each R is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; and

each R′ is independently selected from hydrogen, a sidechain group of an amino acid, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments of the linker structures depicted above, each f is independently 0 or an integer from 1 to 12; each n is independently 0 or an integer from 1 to 30; each y is independently 0 or an integer from 1 to 20; each h is independently 0 or an integer from 1 to 12; each p is independently 0 or an integer from 1 to 20.

In certain embodiments of the linker structures depicted above, each R is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments of the linker structures depicted above, each R′ is independently hydrogen, a sidechain of an amino acid, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments of the linker structures depicted above, each f is independently 0, 1, 2, 3, 4, 5 or 6; each n is independently 0, 1, 2, 3, 4, 5 or 6; each y is independently 0, 1, 2, 3, 4, 5 or 6; each h is independently 0, 1, 2, 3, 4, 5 or 6; and each p is independently 0, 1, 2, 3, 4, 5 or 6. In certain embodiments of the linker structures depicted above, each R is independently H, methyl or —(CH₂)_(m)—OH where m is 1, 2, 3, 4, 5 or 6 (e.g., 2).

In certain embodiments, the linker includes a cleavable group, e.g., as described in the following structures:

wherein:

each f is independently 0 or an integer from 1 to 12;

each n is independently 0 or an integer from 1 to 30;

each y is independently 0 or an integer from 1 to 20;

each h is independently 0 or an integer from 1 to 12;

each p is independently 0 or an integer from 1 to 20;

each R is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; and

each R′ is independently selected from hydrogen, a sidechain group of an amino acid, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments of the linker structures depicted above, each f is independently 0 or an integer from 1 to 12; each n is independently 0 or an integer from 1 to 30; each y is independently 0 or an integer from 1 to 20; each h is independently 0 or an integer from 1 to 12; each p is independently 0 or an integer from 1 to 20.

In certain embodiments of the linker structures depicted above, each R is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments of the linker structures depicted above, each R′ is independently hydrogen, a sidechain of an amino acid, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.

In certain embodiments of the linker structures depicted above, each f is independently 0, 1, 2, 3, 4, 5 or 6; each n is independently 0, 1, 2, 3, 4, 5 or 6; each y is independently 0, 1, 2, 3, 4, 5 or 6; each h is independently 0, 1, 2, 3, 4, 5 or 6; and each p is independently 0, 1, 2, 3, 4, 5 or 6. In certain embodiments of the linker structures depicted above, each R is independently H, methyl or —(CH₂)_(m)—OH where m is 1, 2, 3, 4, 5 or 6 (e.g., 2).

Any of the chemical entities, linkers and coupling moieties set forth in the structures above may be adapted for use in the subject compounds and conjugates.

Additional disclosure related to conjugates, linkers and methods for producing conjugates is found in U.S. Application Publication No. 2015/0157736, filed Nov. 26, 2014, the disclosure of which is incorporated herein by reference.

Target Polypeptides

Any of a wide variety of polypeptides can be modified to be conjugated to a moiety of interest as described above. Polypeptides suitable for modification include both proteins having a naturally-occurring amino acid sequence, fragments of naturally-occurring polypeptides, and non-naturally occurring polypeptides and fragments thereof.

The following are examples of classes and types of polypeptides which are of interest for modification using the compounds and methods described herein to produce the polypeptide conjugates described herein.

Therapeutic Polypeptides

In certain embodiments, the methods of producing a conjugate are applied to modification of polypeptides that may provide for a therapeutic benefit, such as those polypeptides for which attachment to a moiety can provide for one or more of, for example, an increase in serum half-life, a decrease in an adverse immune response, additional or alternate biological activity or functionality, and the like, or other benefit or reduction of an adverse side effect. Where the therapeutic polypeptide is an antigen for a vaccine, modification can provide for an enhanced immunogenicity of the polypeptide.

Examples of classes of therapeutic proteins include those that are cytokines, chemokines, growth factors, hormones, antibodies, and antigens. Further examples include, but are not limited to, the following: erythropoietin (EPO, e.g., native EPO or synthetic EPO (see, e.g., US 2003/0191291), such as, but not limited to, e.g., PROCRIT®, EPREX®, or EPOGEN® (epoetin-α), ARANESP® (darbepoietin-α), NEORECORMON®, EPOGIN® (epoetin-β), and the like); a growth hormone (e.g., a somatotropin, e.g., GENOTROPIN®, NUTROPIN®, NORDITROPIN®, SAIZEN®, SEROSTIM®, HUMATROPE®, etc.); human growth hormone (hGH); bovine growth hormone (bGH); follicle stimulating hormone (FSH); interferon (e.g., IFN-γ, IFN-α, IFN-β, IFN-ω; IFN-τ, consensus interferon, and the like); insulin (e.g., Novolin, Humulin, Humalog, Lantus, Ultralente, etc.), insulin-like growth factor (e.g., IGF-I, IGF-II); blood factors (e.g., Factor X, tissue plasminogen activator (TPA), and the like, such as, but not limited to, e.g., ACTIVASE® (alteplase) tissue plasminogen activator, NOVOSEVEN® (recombinant human factor VIIa), Factor VIIa, Factor VIII (e.g., KOGENATE®), Factor IX, β-globin, hemoglobin, and the like); colony stimulating factors (e.g., granulocyte-CSF (G-CSF, e.g., NEUPOGEN® (filgrastim)), macrophage-CSF (M-CSF), granulocyte-macrophage-CSF (GM-CSF), Neulasta (pegfilgrastim), granulocyte-monocyte colony stimulating factor, megakaryocyte colony stimulating factor, and the like), transforming growth factors (e.g., TGF-beta, TGF-alpha); interleukins (e.g., IL-1, IL-2 (e.g., Proleukin®), IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-12, and the like); a growth factor (e.g., epidermal growth factor (EGF), platelet-derived growth factor (PDGF, e.g., REGRANEX® (beclapermin)), fibroblast growth factors (FGFs, e.g., aFGF, bFGF, such as FIBLAST® (trafermin)), glial cell line-derived growth factor (GDNF), nerve growth factor (NGF), stem cell factor (e.g., STEMGEN® (ancestim)), keratinocyte growth factor, a hepatocyte growth factor, and the like); a soluble receptor (e.g., a TNF-α-binding soluble receptor such as ENBREL® (etanercept), a soluble VEGF receptor, a soluble interleukin receptor, a soluble γ/δ T cell receptor, and the like); an enzyme (e.g., α-glucosidase, CERAZYME® (imiglucarase, β-glucocerebrosidase, CEREDASE® (alglucerase); an enzyme activator (e.g., tissue plasminogen activator); a chemokine (e.g., IP-10, Mig, Groα/IL-8, regulated and normal T cell expressed and secreted (RANTES), MIP-1α, MIP-1β, MCP-1, PF-4, and the like); an angiogenic agent (e.g., vascular endothelial growth factor (VEGF); an anti-angiogenic agent (e.g., a soluble VEGF receptor); a protein vaccine; a neuroactive peptide such as bradykinin, cholecystokinin, gastin, secretin, oxytocin, gonadotropin-releasing hormone, beta-endorphin, enkephalin, substance P, somatostatin, galanin, growth hormone-releasing hormone, bombesin, warfarin, dynorphin, neurotensin, motilin, thyrotropin, neuropeptide Y, luteinizing hormone, calcitonin, insulin, glucagon, vasopressin, angiotensin II, thyrotropin-releasing hormone, vasoactive intestinal peptide, a sleep peptide, etc.; other proteins such as a thrombolytic agent, an atrial natriuretic peptide, bone morphogenic protein, thrombopoietin, relaxin, glial fibrillary acidic protein, follicle stimulating hormone, a human alpha-1 antitrypsin, a leukemia inhibitory factor, a transforming growth factor, a tissue factor, an insulin-like growth factor, a luteinizing hormone, a follicle stimulating hormone, a macrophage activating factor, tumor necrosis factor, a neutrophil chemotactic factor, a nerve growth factor, a tissue inhibitor of metalloproteinases; a vasoactive intestinal peptide, angiogenin, angiotropin, fibrin; hirudin; a leukemia inhibitory factor; an IL-1 receptor antagonist (e.g., Kineret® (anakinra)); and the like. It will be readily appreciated that native forms of the above therapeutic proteins are also of interest as target polypeptides in the present disclosure.

Further examples include antibodies, e.g., polyclonal antibodies, monoclonal antibodies, humanized antibodies, antigen-binding fragments (e.g., F(ab)′, Fab, Fv), single chain antibodies, and the like (e.g., RITUXAN® (rituximab); REMICADE® (infliximab); HERCEPTIN® (trastuzumab); HUMIRA™ (adalimumab); XOLAIR® (omalizumab); BEXXAR® (tositumomab); RAPTIVA™ (efalizumab); ERBITUX™ (cetuximab); and the like). In some instances, antibodies include antibodies that specifically bind to a tumor antigen, an immune cell antigen (e.g., CD4, CD8, and the like), an antigen of a microorganism, particularly a pathogenic microorganism (e.g., a bacterial, viral, fungal, or parasitic antigen), and the like.

In some instances, the methods, conjugates and compounds described herein can be applied to provide for a moiety (e.g., a water-soluble polymer) at a native or engineered site of glycosylation, such as found in hyperglycosylated forms of a therapeutic protein.

The biological activity of a modified target polypeptide can be assayed according to methods known in the art. Modified polypeptides that retain at least one desired pharmacologic activity of the corresponding parent protein are of interest.

Immunogenic Compositions

The methods, conjugates and compounds disclosed herein also find application in production of components of immunogenic compositions (e.g., therapeutic vaccines). For example, the compounds can be used to facilitate attachment of moieties that increase serum half-life of a polypeptide antigen, that increase immunogenicity of the polypeptide, or that link a non-amino acid antigen to a polypeptide carrier. In this regard, the compounds can be used to facilitate modification of microbial antigens (e.g., a bacterial, viral, fungal, or parasitic antigen), tumor antigens, and other antigens which are of interest for administration to a subject to elicit an immune response in the subject. Also of interest is modification of antigens that are useful in eliciting antibodies which can be useful as research tools.

Further examples of polypeptides of interest for modification using the compounds disclosed herein include those that are of interest for detection or functional monitoring in an assay (e.g., as a research tool, in a drug screening assay, and the like). Examples of polypeptides of this type include receptors (e.g., G-protein coupled receptors (GPCRs, including orphan GPCRs)), receptor ligands (including naturally-occurring and synthetic), protein channels (e.g., ion channels (e.g., potassium channels, calcium channels, sodium channels, and the like), and other polypeptides. In some embodiments, modification of cell surface-associated polypeptides, such as transmembrane polypeptides) is of interest, for example where such modification is accomplished while the polypeptide is present in a membrane. Methods for modification of a polypeptide under physiological conditions are described further below.

Methods of Polypeptide Production

In general, the polypeptides described herein may be expressed in prokaryotes or eukaryotes in accordance with conventional ways, depending upon the purpose for expression. Thus, the present invention further provides a host cell, e.g., a genetically modified host cell that comprises a nucleic acid encoding a polypeptide.

Host cells for production (including large scale production) of an unconjugated or modified polypeptide suitable to form a conjugate as described herein can be selected from any of a variety of available host cells. Examples of host cells include those of a prokaryotic or eukaryotic unicellular organism, such as bacteria (e.g., Escherichia coli strains, Bacillus spp. (e.g., B. subtilis), and the like) yeast or fungi (e.g., S. cerevisiae, Pichia spp., and the like), and other such host cells can be used. Examples of host cells originally derived from a higher organism such as insects, vertebrates, including mammals, (e.g., CHO, HEK, and the like), may be used as the expression host cells.

Suitable mammalian cell lines include, but are not limited to, HeLa cells (e.g., American Type Culture Collection (ATCC) No. CCL-2), CHO cells (e.g., ATCC Nos. CRL9618 and CRL9096), CHO DG44 cells (Urlaub (1983) Cell 33:405), CHO-K1 cells (ATCC CCL-61), 293 cells (e.g., ATCC No. CRL-1573), Vero cells, NIH 3T3 cells (e.g., ATCC No. CRL-1658), Huh-7 cells, BHK cells (e.g., ATCC No. CCL10), PC12 cells (ATCC No. CRL1721), COS cells, COS-7 cells (ATCC No. CRL1651), RAT1 cells, mouse L cells (ATCC No. CCLI.3), human embryonic kidney (HEK) cells (ATCC No. CRL1573), HLHepG2 cells, and the like.

Specific expression systems of interest include bacterial, yeast, insect cell and mammalian cell derived expression systems.

The expressed polypeptide can be recovered by any appropriate means known in the art. Further, any convenient protein purification procedures may be employed, where suitable protein purification methodologies are described in Guide to Protein Purification, (Deuthser ed.) (Academic Press, 1990). For example, a lysate may prepared from a cell comprising the expression vector expressing the desired polypeptide, and purified using high performance liquid chromatography (HPLC), exclusion chromatography, gel electrophoresis, affinity chromatography, and the like.

Methods for Modification of a Polypeptide

In certain embodiments, the polypeptide may be conjugated to a moiety of interest without first modifying the polypeptide. For instance, the polypeptide may include one or more reactive groups suitable for conjugation to the moiety of interest (e.g., a moiety comprising a coupling moiety, such as a hydrazinyl-substituted heteroaryl compound or derivative thereof as described herein). In other embodiments, the polypeptide may be modified before conjugation to the moiety of interest. Modification of the polypeptide may produce a modified polypeptide that contains one or more reactive groups suitable for conjugation to the moiety of interest.

In some cases, the polypeptide may be modified at one or more amino acid residues to provide one or more reactive groups suitable for conjugation to the moiety of interest (e.g., a moiety comprising a coupling moiety, such as a hydrazinyl-substituted heteroaryl compound or derivative thereof as described herein). For example, carbonyls introduced into a polypeptide can be selectively reacted with α-nucleophiles, such as aminooxy- and hydrazide-bearing compounds. Chemistries selective for carbonyl functional groups on a protein with enhanced kinetics, site selectivity and conjugate stability may result in improved bioconjugates and provide access to new products and therapeutic targets as disclosed herein.

In certain embodiments, the polypeptide may be modified to include a reactive aldehyde group (e.g., a reactive aldehyde). A reactive aldehyde may be included in an “aldehyde tag” or “ald-tag”, which is meant to refer to an amino acid sequence derived from a sulfatase motif that has been converted by action of a formylglycine generating enzyme (FGE) to contain a 2-formylglycine residue (referred to herein as “fGly”). The fGly residue generated by an FGE is also referred to in the literature as a “formylglycine”. Stated differently, the term “aldehyde tag” is used herein to refer to an amino acid sequence comprising a “converted” sulfatase motif (i.e., a sulfatase motif in which the cysteine or the serine residue has been converted to fGly by action of an FGE, e.g., L(fGly)TPSR). A converted sulfatase motif may be derived from an amino acid sequence comprising an “unconverted” sulfatase motif (i.e., a sulfatase motif in which the cysteine or serine residues has not been converted to fGly by an FGE, but is capable of being converted, e.g., an unconverted sulfatase motif with the sequence: L(C/S)TPSR). By “conversion” as used in the context of action of a formylglycine generating enzyme (FGE) on a sulfatase motif refers to biochemical modification of a cysteine or serine residue in a sulfatase motif to a formylglycine (fGly) residue (e.g., Cys to fGly, or Ser to fGly). Additional aspects of aldehyde tags and uses thereof in site-specific protein modification are described in U.S. Pat. Nos. 8,846,866; 8,729,232; 8,349,910; 8,097,701; and 7,985,783, the disclosures of each of which are incorporated herein by reference.

Conversion of a polypeptide to include fGly can be accomplished by cell-based (in vivo) or cell-free methods (in vitro). Similarly, modification of a polypeptide to produce a polypeptide suitable for conjugation (e.g., modification to produce a polypeptide containing a reactive group suitable for conjugation) can be accomplished by cell-based (in vivo) or cell-free methods (in vitro).

Alternatively, isolated, unmodified polypeptides can be isolated following recombinant production in a host cell lacking a suitable enzyme or by synthetic production. The isolated polypeptide may then be contacted with a suitable enzyme (e.g., a formylglycine generating enzyme; FGE) under conditions to provide for the desired modification of the polypeptide to include fGly. The polypeptide can be unfolded by methods known in the art (e.g., using heat, adjustment of pH, chaotropic agents, (e.g., urea, and the like), organic solvents (e.g., hydrocarbons: octane, benzene, chloroform), etc.) and the denatured protein contacted with a suitable enzyme. The modified polypeptide can then be refolded under suitable conditions.

Additional aspects of suitable enzymes (e.g., FGEs) and uses thereof in site-specific protein modification are described in U.S. application Ser. No. 14/975,403, filed Dec. 18, 2015, the disclosure of which is incorporated herein by reference.

In some cases, the modified polypeptide containing the fGly residue may be conjugated to the moiety of interest by reaction of the fGly with a compound as described herein (e.g., a compound containing a coupling moiety, such as a hydrazinyl-substituted heteroaryl compound or derivative thereof as described herein). For example, an fGly-containing polypeptide may be isolated from a production source (e.g., recombinant host cell production, synthetic production), and contacted with a reactive partner-containing drug or other moiety (e.g., detectable label) under conditions suitable to provide for conjugation of the drug or other moiety to the polypeptide. For example, the reactive partner-containing drug or other moiety may include a reactive moiety (e.g., a hydrazinyl-substituted heteroaryl compound or derivative thereof as described herein). The hydrazinyl-substituted heteroaryl-containing drug or other moiety may be reacted with the polypeptide to produce a polypeptide conjugate as described herein.

To conjugate a hydrazinyl-substituted heteroaryl-modified drug to an aldehyde-tagged antibody as described herein, the following general protocol may be used. A desired amount of the hydrazinyl-substituted heteroaryl-modified drug may be reacted with an appropriate aldehyde-tagged antibody (e.g., an aldehyde-tagged antibody (dimer) with one fGly (e.g., reactive aldehyde group) per chain). After the reaction is complete, unreacted drug can be removed using diafiltration. Mono- and di-conjugated species may then be purified away from unconjugated material, for example using hydrophobic interaction chromatography (HIC). The enriched sample may be put into a buffer (e.g., a final formulation buffer, such as phosphate buffered saline (PBS)), for example by using diafiltration. The final sample may be analyzed using hydrophobic interaction chromatography (HIC) to determine the drug to antibody ratio (DAR) and/or may be analyzed using size exclusion chromatography (SEC) to determine the level of aggregation.

Polypeptide Conjugates

The polypeptides can be subjected to conjugation to provide for attachment of a wide variety of moieties. Examples of moieties of interest include, but are not limited to, a drug, a detectable label, a small molecule, a water-soluble polymer, a peptide, and the like (also referred to a “payload” or “cargo” herein). Thus, the present disclosure provides a polypeptide conjugate as described above.

The moiety of interest is provided as a component of a reactive partner for reaction with a residue of a polypeptide. In certain embodiments, the methods of polypeptide conjugation are compatible with reaction conditions suitable for the polypeptide. For example, the reaction conditions may include a reaction mixture that includes water. In some cases, the reaction mixture may have a pH compatible with the polypeptide, such as, but not limited to, a pH of 4 to 11, or a pH of 5 to 10, or a pH of 6 to 9, or a pH of 6 to 8. In certain instances, the reaction mixture has a pH of 7. In some embodiments, the reaction conditions are performed at a temperature compatible with the polypeptide. For example, the reaction conditions may be at a temperature of 20° C. to 45° C., such as 25° C. to 40° C., or 30° C. to 40° C., or 35° C. to 40° C. In some cases, the reaction conditions are at room temperature (e.g., 25° C.). In some instances, the reaction conditions are at a temperature of 37° C.

Provided the present disclosure, the ordinarily skilled artisan can readily adapt any of a variety of moieties to provide a reactive partner for conjugation to a polypeptide as contemplated herein. The ordinarily skilled artisan will appreciate that factors such as pH and steric hindrance (i.e., the accessibility of the modified amino acid residue to reaction with a reactive partner of interest) are of importance. Modifying reaction conditions to provide for optimal conjugation conditions is well within the skill of the ordinary artisan, and is routine in the art. Where conjugation is conducted with a polypeptide present in or on a living cell, the conditions are selected so as to be physiologically compatible. For example, the pH can be dropped temporarily for a time sufficient to allow for the reaction to occur but within a period tolerated by the cell (e.g., from about 30 min to 1 hour). Physiological conditions for conducting modification of polypeptides on a cell surface can be similar to those used in a ketone-azide reaction in modification of cells bearing cell-surface azides (see, e.g., U.S. Pat. No. 6,570,040).

In certain embodiments, the present disclosure provides a polypeptide conjugate, where the polypeptide is an antibody. As such, embodiments include an antibody conjugated to a moiety of interest, where an antibody conjugated to a moiety of interest is referred to as an “antibody conjugate.” An Ig polypeptide generally includes at least an Ig heavy chain constant region or an Ig light chain constant region, and can further include an Ig variable region (e.g., a V_(L) region and/or a V_(H) region). Ig heavy chain constant regions include Ig constant regions of any heavy chain isotype, non-naturally occurring Ig heavy chain constant regions (including consensus Ig heavy chain constant regions). An Ig constant region can be modified to be conjugated to a moiety of interest, where the moiety of interest is present in or adjacent a solvent-accessible loop region of the Ig constant region.

In some cases, an antibody conjugate of the present disclosure can include: 1) Ig heavy chain constant region conjugated to one or more moieties of interest, and an Ig light chain constant region conjugated to one or more moieties of interest; 2) an Ig heavy chain constant region conjugated to one or more moieties of interest, and an Ig light chain constant region that is not conjugated to a moiety of interest; or 3) an Ig heavy chain constant region that is not conjugated to a moiety of interest, and an Ig light chain constant region conjugated to one or more moieties of interest. A subject antibody conjugate can also include variable VH and/or VL domains. As described above, the one or more moieties of interest may be conjugated to the Ig heavy chain constant region or the Ig light chain constant region at a single amino acid residue (e.g., one or two moieties of interest conjugated to a single amino acid residue), or conjugated to the Ig heavy chain constant region and/or the Ig light chain constant region at two or more different amino acid residues.

An antibody conjugate of the present disclosure can include, as the conjugated moiety, any of a variety of compounds, as described herein, e.g., a drug (e.g., a peptide drug, a small molecule drug, and the like), a water-soluble polymer, a detectable label, a synthetic peptide, etc.

An antibody conjugate can have any of a variety of antigen-binding specificities, as described above, including, e.g., an antigen present on a cancer cell; an antigen present on an autoimmune cell; an antigen present on a pathogenic microorganism; an antigen present on a virus-infected cell (e.g., a human immunodeficiency virus-infected cell), e.g., CD4 or gp120; an antigen present on a diseased cell; and the like. For example, an antibody conjugate can bind an antigen, as noted above, where the antigen is present on the surface of the cell. An antibody conjugate of the present disclosure can bind antigen with a suitable binding affinity, e.g., from 5×10⁻⁶ M to 10⁻⁷ M, from 10⁻⁷ M to 5×10⁻⁷ M, from 5×10⁻⁷ M to 10⁻⁸ M, from 10⁻⁸ M to 5×10⁻⁸ M, from 5×10⁻⁸ M to 10⁻⁹ M, or a binding affinity greater than 10⁻⁹ M.

As non-limiting examples, a subject antibody conjugate can bind an antigen present on a cancer cell (e.g., a tumor-specific antigen; an antigen that is over-expressed on a cancer cell; etc.), and the conjugated moiety can be a cytotoxic compound (e.g., a cytotoxic small molecule, a cytotoxic synthetic peptide, etc.). For example, a subject antibody conjugate can be specific for an antigen on a cancer cell, where the conjugated moiety is a cytotoxic compound (e.g., a cytotoxic small molecule, a cytotoxic synthetic peptide, etc.).

As further non-limiting examples, a subject antibody conjugate can bind an antigen present on a cell infected with a virus (e.g., where the antigen is encoded by the virus; where the antigen is expressed on a cell type that is infected by a virus; etc.), and the conjugated moiety can be a viral fusion inhibitor. For example, a subject antibody conjugate can bind an antigen present on a cell infected with a virus, and the conjugated moiety can be a viral fusion inhibitor.

Embodiments of the present disclosure also include polypeptide conjugates where the polypeptide is a carrier protein. For example, carrier proteins can be covalently and site-specifically bound to drug to provide a drug-containing scaffold. A carrier protein can be site-specifically conjugated to a covalently bound molecule of interest, such as a drug (e.g., a peptide, a small molecule drug, and the like), detectable label, etc. In certain embodiments, drug-scaffold conjugates can provide for enhanced serum half-life of the drug.

In general a “carrier protein” is a protein that is biologically inert, is susceptible to modification as disclosed herein, and which can provide for solvent-accessible presentation of the moiety of interest conjugated to the carrier protein through a modified amino acid residue in the carrier protein (e.g., through an oxime or hydrazone bond within the converted sulfatase motif of an aldehyde tagged carrier protein) in a physiological environment. “Biologically inert” is meant to indicate the carrier protein exhibits clinically insignificant or no detectable biological activity when administered to the appropriate subject, such as when administered to a human subject. Thus, carrier proteins are biologically inert in that they, for example, are of low immunogenicity, do not exhibit significant or detectable targeting properties (e.g., do not exhibit significant or detectable activity in binding to a specific receptor), and exhibit little or no detectable biological activity that may interfere with activity of the moiety (e.g., drug or detectable label) conjugated to the aldehyde-tagged carrier protein. By “low immunogenicity” is meant that the carrier protein elicits little or no detectable immune response upon administration to a subject, such as a mammalian subject, e.g., a human subject. Carrier proteins can be provided in monomeric or multimeric (e.g., dimeric) forms.

Carrier proteins having a three-dimensional structure when folded that provides for multiple different solvent-accessible sites that are amenable to modification (and thus conjugation to a moiety of interest) are of interest. In general, carrier proteins of interest are those that are of a size and three-dimensional folded structure so as to provide for presentation of the conjugated moiety of interest on solvent accessible surfaces in a manner that is sufficiently spatially separated so as to provide for activity and bioavailability of the conjugated moiety or moieties of interest. The carrier protein may be selected according to a variety of factors including, but not limited to, the moiety (e.g., drug or detectable label) to be conjugated to the carrier protein.

Accordingly, any of a wide variety of polypeptides can be suitable for use as carrier proteins for use in the carrier protein conjugates of the present disclosure. Such carrier proteins can include those having a naturally-occurring amino acid sequence, fragments of naturally-occurring polypeptides, and non-naturally occurring polypeptides and fragments thereof.

Examples of carrier proteins include, but are not limited to, albumin and fragments thereof (e.g., human serum albumin, bovine serum albumin, and the like), transferrin and fragments thereof (e.g. human transferrin), and Fc fragments having reduced binding to a mammalian Fc receptor, particularly a human Fc receptor (e.g., a modified Fc fragment of an antibody (e.g., IgG), such as a mammalian antibody, e.g., a human antibody). Examples of modified Fc fragments having reduced Fc receptor binding are exemplified by the Fc fragments of Herceptin (trastuzumab) and Rituxan (Rituximab), which contain point mutations that provide for reduced Fc receptor binding (see, e.g., Clynes et al., Nature Medicine (2000), 6, 443-446). Alternatively or in addition, the isotype of the Fc fragment can be selected according to a desired level of Fc receptor binding (e.g., use of an Fc fragment of an IgG4 isotype human heavy chain constant region rather than from IgG1 or IgG3. (see, e.g., Fridman FASEB J 1991 September; 5 (12): 2684-90). In general, carrier proteins can be at least about 4 kDa (e.g., about 50 amino acid residues in length), usually at least about 25 kDa, and can be larger in size (e.g., transferrin has a molecular weight of 90kDa while Fc fragments can have molecular weights of 30kDa to 50kDa).

The conjugates described herein can be used for a variety of applications including, but not limited to, visualization using fluorescence or epitope labeling (e.g., electron microscopy using gold particles equipped with reactive groups for conjugation to the compounds and conjugates described herein); protein immobilization (e.g., protein microarray production); protein dynamics and localization studies and applications; and conjugation of proteins with a moiety of interest (e.g., moieties that improve a parent protein's half-life (e.g., poly(ethylene glycol)), targeting moieties (e.g., to enhance delivery to a site of action), and biologically active moieties (e.g., a therapeutic moiety).

The polypeptide conjugate may include a polypeptide conjugated to a moiety or moieties that provide for one or more of a wide variety of functions or features. In general, examples of moieties include, but are not limited to, the following: detectable labels (e.g., fluorescent labels); light-activated dynamic moieties (e.g., azobenzene mediated pore closing, azobenzene mediated structural changes, photodecaging recognition motifs); water soluble polymers (e.g., PEGylation); purification tags (e.g., to facilitate isolation by affinity chromatography (e.g., attachment of a FLAG epitope); membrane localization domains (e.g., lipids or glycophosphatidylinositol (GPI)-type anchors); immobilization tags (e.g., to facilitate attachment of the polypeptide to a surface, including selective attachment); drugs (e.g., to facilitate drug targeting, e.g., through attachment of the drug to an antibody); targeted delivery moieties, (e.g., ligands for binding to a target receptor (e.g., to facilitate viral attachment, attachment of a targeting protein present on a liposome, etc.)), and the like.

Specific, non-limiting examples are provided below.

Drugs for Conjugation to a Polypeptide

Any of a number of drugs are suitable for use, or can be modified to be rendered suitable for use, as a reactive partner to conjugate to a polypeptide. Examples of drugs include small molecule drugs and peptide drugs. Thus, the present disclosure provides drug-polypeptide conjugates.

“Small molecule drug” as used herein refers to a compound, e.g., an organic compound, which exhibits a pharmaceutical activity of interest and which is generally of a molecular weight of 800 Da or less, or 2000 Da or less, but can encompass molecules of up to 5kDa and can be as large as 10 kDa. A small inorganic molecule refers to a molecule containing no carbon atoms, while a small organic molecule refers to a compound containing at least one carbon atom.

“Peptide drug” as used herein refers to amino-acid containing polymeric compounds, and is meant to encompass naturally-occurring and non-naturally-occurring peptides, oligopeptides, cyclic peptides, polypeptides, and proteins, as well as peptide mimetics. The peptide drugs may be obtained by chemical synthesis or be produced from a genetically encoded source (e.g., recombinant source). Peptide drugs can range in molecular weight, and can be from 200 Da to 10 kDa or greater in molecular weight.

In some cases, the drug is a cancer chemotherapeutic agent. For example, where the polypeptide is an antibody (or fragment thereof) that has specificity for a tumor cell, the antibody can be modified as described herein to include a modified amino acid, which can be subsequently conjugated to a cancer chemotherapeutic agent. Cancer chemotherapeutic agents include non-peptidic (i.e., non-proteinaceous) compounds that reduce proliferation of cancer cells, and encompass cytotoxic agents and cytostatic agents. Non-limiting examples of chemotherapeutic agents include alkylating agents, nitrosoureas, antimetabolites, antitumor antibiotics, plant (vinca) alkaloids, and steroid hormones. Peptidic compounds can also be used.

Suitable cancer chemotherapeutic agents include dolastatin and active analogs and derivatives thereof; and auristatin and active analogs and derivatives thereof (e.g., Monomethyl auristatin D (MMAD), monomethyl auristatin E (MMAE), monomethyl auristatin F (MMAF), and the like). See, e.g., WO 96/33212, WO 96/14856, and U.S. Pat. No. 6,323,315. For example, dolastatin 10 or auristatin PE can be included in an antibody-drug conjugate of the present disclosure. Suitable cancer chemotherapeutic agents also include maytansinoids and active analogs and derivatives thereof (see, e.g., EP 1391213; and Liu et al (1996) Proc. Natl. Acad. Sci. USA 93:8618-8623); duocarmycins and active analogs and derivatives thereof (e.g., including the synthetic analogues, KW-2189 and CB 1-TM1); and benzodiazepines and active analogs and derivatives thereof (e.g., pyrrolobenzodiazepine (PBD).

Agents that act to reduce cellular proliferation are known in the art and widely used. Such agents include alkylating agents, such as nitrogen mustards, nitrosoureas, ethylenimine derivatives, alkyl sulfonates, and triazenes, including, but not limited to, mechlorethamine, cyclophosphamide (Cytoxan™), melphalan (L-sarcolysin), carmustine (BCNU), lomustine (CCNU), semustine (methyl-CCNU), streptozocin, chlorozotocin, uracil mustard, chlormethine, ifosfamide, chlorambucil, pipobroman, triethylenemelamine, triethylenethiophosphoramine, busulfan, dacarbazine, and temozolomide.

Antimetabolite agents include folic acid analogs, pyrimidine analogs, purine analogs, and adenosine deaminase inhibitors, including, but not limited to, cytarabine (CYTOSAR-U), cytosine arabinoside, fluorouracil (5-FU), floxuridine (FudR), 6-thioguanine, 6-mercaptopurine (6-MP), pentostatin, 5-fluorouracil (5-FU), methotrexate, 10-propargyl-5,8-dideazafolate (PDDF, CB3717), 5,8-dideazatetrahydrofolic acid (DDATHF), leucovorin, fludarabine phosphate, pentostatine, and gemcitabine.

Suitable natural products and their derivatives, (e.g., vinca alkaloids, antitumor antibiotics, enzymes, lymphokines, and epipodophyllotoxins), include, but are not limited to, Ara-C, paclitaxel (Taxol®), docetaxel (Taxotere®), deoxycoformycin, mitomycin-C, L-asparaginase, azathioprine; brequinar; alkaloids, e.g. vincristine, vinblastine, vinorelbine, vindesine, etc.; podophyllotoxins, e.g. etoposide, teniposide, etc.; antibiotics, e.g. anthracycline, daunorubicin hydrochloride (daunomycin, rubidomycin, cerubidine), idarubicin, doxorubicin, epirubicin and morpholino derivatives, etc.; phenoxizone biscyclopeptides, e.g. dactinomycin; basic glycopeptides, e.g. bleomycin; anthraquinone glycosides, e.g. plicamycin (mithramycin); anthracenediones, e.g. mitoxantrone; azirinopyrrolo indolediones, e.g. mitomycin; macrocyclic immunosuppressants, e.g. cyclosporine, FK-506 (tacrolimus, prograf), rapamycin, etc.; and the like.

Other anti-proliferative cytotoxic agents are navelbene, CPT-11, anastrazole, letrazole, capecitabine, reloxafine, cyclophosphamide, ifosamide, and droloxafine.

Microtubule affecting agents that have antiproliferative activity are also suitable for use and include, but are not limited to, allocolchicine (NSC 406042), Halichondrin B (NSC 609395), colchicine (NSC 757), colchicine derivatives (e.g., NSC 33410), dolstatin 10 (NSC 376128), maytansine (NSC 153858), rhizoxin (NSC 332598), paclitaxel (Taxol®), Taxol® derivatives, docetaxel (Taxotere®), thiocolchicine (NSC 361792), trityl cysterin, vinblastine sulfate, vincristine sulfate, natural and synthetic epothilones including but not limited to, eopthilone A, epothilone B, discodermolide; estramustine, nocodazole, and the like.

Hormone modulators and steroids (including synthetic analogs) that are suitable for use include, but are not limited to, adrenocorticosteroids, e.g. prednisone, dexamethasone, etc.; estrogens and pregestins, e.g. hydroxyprogesterone caproate, medroxyprogesterone acetate, megestrol acetate, estradiol, clomiphene, tamoxifen; etc.; and adrenocortical suppressants, e.g. aminoglutethimide; 17α-ethinylestradiol; diethylstilbestrol, testosterone, fluoxymesterone, dromostanolone propionate, testolactone, methylprednisolone, methyl-testosterone, prednisolone, triamcinolone, chlorotrianisene, hydroxyprogesterone, aminoglutethimide, estramustine, medroxyprogesterone acetate, leuprolide, Flutamide (Drogenil), Toremifene (Fareston), and Zoladex®. Estrogens stimulate proliferation and differentiation; therefore compounds that bind to the estrogen receptor are used to block this activity. Corticosteroids may inhibit T cell proliferation.

Other suitable chemotherapeutic agents include metal complexes, e.g. cisplatin (cis-DDP), carboplatin, etc.; ureas, e.g. hydroxyurea; and hydrazines, e.g. N-methylhydrazine; epidophyllotoxin; a topoisomerase inhibitor; procarbazine; mitoxantrone; leucovorin; tegafur; etc. Other anti-proliferative agents of interest include immunosuppressants, e.g. mycophenolic acid, thalidomide, desoxyspergualin, azasporine, leflunomide, mizoribine, azaspirane (SKF 105685); Iressa® (ZD 1839, 4-(3-chloro-4-fluorophenylamino)-7-methoxy-6-(3-(4-morpholinyl)propoxy)quinazoline); etc.

Taxanes are suitable for use. “Taxanes” include paclitaxel, as well as any active taxane derivative or pro-drug. “Paclitaxel” (which should be understood herein to include analogues, formulations, and derivatives such as, for example, docetaxel, TAXOL™, TAXOTERE™ (a formulation of docetaxel), 10-desacetyl analogs of paclitaxel and 3′N-desbenzoyl-3′N-t-butoxycarbonyl analogs of paclitaxel) may be readily prepared utilizing techniques known to those skilled in the art (see also WO 94/07882, WO 94/07881, WO 94/07880, WO 94/07876, WO 93/23555, WO 93/10076; U.S. Pat. Nos. 5,294,637; 5,283,253; 5,279,949; 5,274,137; 5,202,448; 5,200,534; 5,229,529; and EP 590,267), or obtained from a variety of commercial sources, including for example, Sigma Chemical Co., St. Louis, Mo. (T7402 from Taxus brevifolia; or T-1912 from Taxus yannanensis).

Paclitaxel should be understood to refer to not only the common chemically available form of paclitaxel, but analogs and derivatives (e.g., Taxotere™ docetaxel, as noted above) and paclitaxel conjugates (e.g., paclitaxel-PEG, paclitaxel-dextran, or paclitaxel-xylose).

Also included within the term “taxane” are a variety of known derivatives, including both hydrophilic derivatives, and hydrophobic derivatives. Taxane derivatives include, but not limited to, galactose and mannose derivatives described in International Patent Application No. WO 99/18113; piperazino and other derivatives described in WO 99/14209; taxane derivatives described in WO 99/09021, WO 98/22451, and U.S. Pat. No. 5,869,680; 6-thio derivatives described in WO 98/28288; sulfenamide derivatives described in U.S. Pat. No. 5,821,263; and taxol derivative described in U.S. Pat. No. 5,415,869. It further includes prodrugs of paclitaxel including, but not limited to, those described in WO 98/58927; WO 98/13059; and U.S. Pat. No. 5,824,701.

Biological response modifiers suitable for use include, but are not limited to, (1) inhibitors of tyrosine kinase (RTK) activity; (2) inhibitors of serine/threonine kinase activity; (3) tumor-associated antigen antagonists, such as antibodies that bind specifically to a tumor antigen; (4) apoptosis receptor agonists; (5) interleukin-2; (6) IFN-α; (7) IFN-γ; (8) colony-stimulating factors; and (9) inhibitors of angiogenesis.

Examples of drugs include small molecule drugs, such as a cancer chemotherapeutic agent. For example, where the polypeptide is an antibody (or fragment thereof) that has specificity for a tumor cell, the antibody can be modified as described herein to include a modified amino acid, which can be subsequently conjugated to a cancer chemotherapeutic agent, such as a microtubule affecting agent. In certain embodiments, the drug is a microtubule affecting agent that has antiproliferative activity, such as a maytansinoid. In certain embodiments, the drug is a maytansinoid, which as the following structure:

where

indicates the point of attachment between the maytansinoid and the linker, L, in conjugates and compounds described herein. By “point of attachment” is meant that the

symbol indicates the bond between the N of the maytansinoid and the linker, L, in conjugates and compounds described herein. For example, in formula (I), W may be a maytansinoid, such as a maytansinoid of the structure above, where

indicates the point of attachment between the maytansinoid and the linker, L. In some cases, the maytansinoid of the structure above may be referred to as a deacyl maytansine.

Methods for Modification of Drugs to Contain a Reactive Partner

Drugs to be conjugated to a polypeptide may be modified to incorporate a reactive partner for reaction with the polypeptide. Where the drug is a peptide drug, the reactive moiety (e.g., aminooxy or hydrazide can be positioned at an N-terminal region, the N-terminus, a C-terminal region, the C-terminus, or at a position internal to the peptide. For example, an example of a method involves synthesizing a peptide drug having an aminooxy group. In this example, the peptide is synthesized from a Boc-protected precursor. An amino group of a peptide can react with a compound comprising a carboxylic acid group and oxy-N-Boc group. As an example, the amino group of the peptide reacts with 3-(2,5-dioxopyrrolidin-1-yloxy)propanoic acid. Other variations on the compound comprising a carboxylic acid group and oxy-N-protecting group can include different number of carbons in the alkylene linker and substituents on the alkylene linker. The reaction between the amino group of the peptide and the compound comprising a carboxylic acid group and oxy-N-protecting group occurs through standard peptide coupling chemistry. Examples of peptide coupling reagents that can be used include, but not limited to, DCC (dicyclohexylcarbodiimide), DIC (diisopropylcarbodiimide), di-p-toluoylcarbodiimide, BDP (1-benzotriazole diethylphosphate-1-cyclohexyl-3-(2-morpholinylethyl)carbodiimide), EDC (1-(3-dimethylaminopropyl-3-ethyl-carbodiimide hydrochloride), cyanuric fluoride, cyanuric chloride, TFFH (tetramethyl fluoroformamidinium hexafluorophosphosphate), DPPA (diphenylphosphorazidate), BOP (benzotriazol-1-yloxytris(dimethylamino)phosphonium hexafluorophosphate), HBTU (O-benzotriazol-1-yl-N,N,N′,N′-tetramethyluronium hexafluorophosphate), TBTU (O-benzotriazol-1-yl-N,N,N′,N′-tetramethyluronium tetrafluoroborate), TSTU (O-(N-succinimidyl)-N,N,N′,N′-tetramethyluronium tetrafluoroborate), HATU (N-[(dimethylamino)-1-H-1,2,3-triazolo[4,5,6]-pyridin-1-ylmethylene]-N-methylmethanaminium hexafluorophosphate N-oxide), BOP-Cl (bis(2-oxo-3-oxazolidinyl)phosphinic chloride), PyBOP ((1-H-1,2,3-benzotriazol-1-yloxy)-tris(pyrrolidino)phosphonium tetrafluorophopsphate), BrOP (bromotris(dimethylamino)phosphonium hexafluorophosphate), DEPBT (3-(diethoxyphosphoryloxy)-1,2,3-benzotriazin-4(3H)-one) PyBrOP (bromotris(pyrrolidino)phosphonium hexafluorophosphate). As a non-limiting example, HOBt and DIC can be used as peptide coupling reagents.

Deprotection to expose the amino-oxy functionality is performed on the peptide comprising an N-protecting group. Deprotection of the N-oxysuccinimide group, for example, occurs according to standard deprotection conditions for a cyclic amide group. Deprotecting conditions can be found in Greene and Wuts, Protective Groups in Organic Chemistry, 3rd Ed., 1999, John Wiley & Sons, NY and Harrison et al. Certain deprotection conditions include a hydrazine reagent, amino reagent, or sodium borohydride. Deprotection of a Boc protecting group can occur with TFA. Other reagents for deprotection include, but are not limited to, hydrazine, methylhydrazine, phenylhydrazine, sodium borohydride, and methylamine. The product and intermediates can be purified by conventional means, such as HPLC purification.

The ordinarily skilled artisan will appreciate that factors such as pH and steric hindrance (i.e., the accessibility of the amino acid residue to reaction with a reactive partner of interest) are of importance, Modifying reaction conditions to provide for optimal conjugation conditions is well within the skill of the ordinary artisan, and is routine in the art. Where conjugation is conducted with a polypeptide present in or on a living cell, the conditions are selected so as to be physiologically compatible. For example, the pH can be dropped temporarily for a time sufficient to allow for the reaction to occur but within a period tolerated by the cell (e.g., from about 30 min to 1 hour). Physiological conditions for conducting modification of polypeptides on a cell surface can be similar to those used in a ketone-azide reaction in modification of cells bearing cell-surface azides (see, e.g., U.S. Pat. No. 6,570,040).

Small molecule compounds containing, or modified to contain, an α-nucleophilic group that serves as a reactive partner with a compound or conjugate disclosed herein are also contemplated for use as drugs in the polypeptide-drug conjugates of the present disclosure. General methods are known in the art for chemical synthetic schemes and conditions useful for synthesizing a compound of interest (see, e.g., Smith and March, March's Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, Fifth Edition, Wiley-Interscience, 2001; or Vogel, A Textbook of Practical Organic Chemistry, Including Qualitative Organic Analysis, Fourth Edition, New York: Longman, 1978).

Peptide Drugs

In some cases, a conjugate comprises a covalently linked peptide. Suitable peptides include, but are not limited to, cytotoxic peptides; angiogenic peptides; anti-angiogenic peptides; peptides that activate B cells; peptides that activate T cells; anti-viral peptides; peptides that inhibit viral fusion; peptides that increase production of one or more lymphocyte populations; anti-microbial peptides; growth factors; growth hormone-releasing factors; vasoactive peptides; anti-inflammatory peptides; peptides that regulate glucose metabolism; an anti-thrombotic peptide; an anti-nociceptive peptide; a vasodilator peptide; a platelet aggregation inhibitor; an analgesic; and the like.

In some embodiments, the peptide can be chemically synthesized to include a group reactive with an amino acid residue or a modified amino acid residue of the polypeptide. A suitable synthetic peptide has a length of from 5 amino acids to 100 amino acids, or longer than 100 amino acids; e.g., a suitable peptide has a length of from 5 amino acids (aa) to 10 aa, from 10 aa to 15 aa, from 15 aa to 20 aa, from 20 aa to 25 aa, from 25 aa to 30 aa, from 30 aa to 40 aa, from 40 aa to 50 aa, from 50 aa to 60 aa, from 60 aa to 70 aa, from 70 aa to 80 aa, from 80 aa to 90 aa, or from 90 aa to 100 aa.

In certain embodiments, a peptide can be modified to contain an α-nucleophile-containing moiety (e.g., an aminooxy or hydrazide moiety), e.g., can be reacted with an fGly-containing polypeptide to yield a conjugate in which the polypeptide and peptide are linked by a hydrazone or oxime bond, respectively. Examples of methods of synthesizing a peptide, such that the synthetic peptide comprising a reactive group reactive with an amino acid residue or a modified amino acid residue of the polypeptide, are described above.

Suitable peptides include, but are not limited to, hLF-11 (an 11-amino acid N-terminal fragment of lactoferrin), an anti-microbial peptide; granulysin, an anti-microbial peptide; Plectasin (NZ2114; SAR 215500), an anti-microbial peptide; viral fusion inhibitors such as Fuzeon (enfuvirtide), TRI-1249 (T-1249; see, e.g., Matos et al. (2010) PLoS One 5:e9830), TRI-2635 (T-2635; see, e.g., Eggink et al. (2009) J. Biol. Chem. 284:26941), T651, and TRI-1144; C5a receptor inhibitors such as PMX-53, JPE-1375, and JSM-7717; POT-4, a human complement factor C3 inhibitor; Pancreate (an INGAP derivative sequence, a HIP-human proislet protein); somatostatin; a somatostatin analog such as DEBIO 8609 (Sanvar), octreotide, octreotide (C2L), octreotide QLT, octreotide LAR, Sandostatin LAR, SomaLAR, Somatuline (lanreotide), see, e.g., Deghenghi et al. (2001) Endocrine 14:29; TH9507 (Tesamorelin, a growth hormone-releasing factor); POL7080 (a protegrin analog, an anti-microbial peptide); relaxin; a corticotropin releasing factor agonist such as urotensin, sauvagine, and the like; a heat shock protein derivative such as DiaPep277; a human immunodeficiency virus entry inhibitor; a heat shock protein-20 mimic such as AZX100; a thrombin receptor activating peptide such as TP508 (Chrysalin); a urocortin 2 mimic (e.g., a CRF2 agonist) such as urocortin-2; an immune activator such as Zadaxin (thymalfasin; thymosin-α1), see, e.g., Sjogren (2004) J. Gastroenterol. Hepatol. 19:S69; a hepatitis C virus (HCV) entry inhibitorE2 peptide such as HCV3; an atrial natriuretic peptide such as HANP (Sun 4936; carperitide); an annexin peptide; a defensin (anti-microbial peptide) such as hBD2-4; a defensin (anti-microbial peptide) such as hBD-3; a defensin (anti-microbial peptide) such as PMX-30063; a histatin (anti-microbial peptide) such as histatin-3, histatin-5, histatin-6, and histatin-9; a histatin (anti-microbial peptide) such as PAC-113; an indolicidin (anti-microbial peptide) such as MX-594AN (Omniganin; CLS001); an indolicidin (anti-microbial peptide) such as Omnigard (MBI-226; CPI-226); an anti-microbial peptide such as an insect cecropin; an anti-microbial peptide such as a lactoferrin (talactoferrin); an LL-37/cathelicidin derivative (an anti-microbial peptide) such as P60.4 (OP-145); a magainin (an anti-microbial peptide) such as Pexiganan (MSI-78; Suponex); a protegrin (an anti-microbial peptide) such as IB-367 (Iseganan); an agan peptide; a beta-natriuretic peptide such as Natrecor, or Noratak (Nesiritide), or ularitide; bivalarudin (Angiomax), a thrombin inhibitor; a C peptide derivative; a calcitonin such as Miacalcin (Fortical); an enkephalin derivative; an erythropoiesis-stimulating peptide such as Hematide; a gap junction modulator such as Danegaptide (ZP1609); a gastrin-releasing peptide; a ghrelin; a glucagon-like peptide; a glucagon-like peptide-2 analog such as ZP1846 or ZP1848; a glucosaminyl muramyl dipeptide such as GMDP; a glycopeptide antibiotic such as Oritavancin; a teicoplanin derivative such as Dalbavancin; a gonadotropin releasing hormone (GnRH) such as Zoladex (Lupon) or Triptorelin; a histone deacetylase (HDAC) inhibitor depsipeptide such as PM02734 (Irvalec); an integrin such as eptifibatide; an insulin analog such as Humulog; a kahalalide depsipeptide such as PM02734; a kallikrein inhibitor such as Kalbitor (ecallantide); an antibiotic such as Telavancin; a lipopeptide such as Cubicin or MX-2401; a lutenizing hormone releasing hormone (LHRH) such as goserelin; an LHRH synthetic decapeptide agonist analog such as Treistar (triptorelin pamoate); an LHRH such as Eligard; an M2 protein channel peptide inhibitor; metreleptin; a melanocortin receptor agonist peptide such as bremalanotide/PT-141; a melanocortin; a muramyl tripeptide such as Mepact (mifamurtide); a myelin basic protein peptide such as MBP 8298 (dirucotide); an N-type voltage-gated calcium channel blocker such as Ziconotide (Prialt); a parathyroid hormone peptide; a parathyroid analog such as 768974; a peptide hormone analog such as UGP281; a prostaglandin F2-α receptor inhibitor such as PDC31; a protease inhibitor such as PPL-100; surfaxin; a thromobspondin-1 (TSP-1) mimetic such as CVX-045 or ABT 510; a vasoactive intestinal peptide; vasopressin; a Y2R agonist peptide such as RG7089; obinepeptide; and TM30339.

Detectable Labels

The conjugates, compounds and methods of the present disclosure can be used to conjugate a detectable label to polypeptide. Examples of detectable labels include, but are not limited to, fluorescent molecules (e.g., autofluorescent molecules, molecules that fluoresce upon contact with a reagent, etc.), radioactive labels (e.g., ¹¹¹In, ¹²⁵I, ¹³¹I, ²¹²B, ⁹⁰Y, ¹⁸⁶Rh, and the like), biotin (e.g., to be detected through reaction of biotin and avidin), fluorescent tags, imaging reagents, and the like. Detectable labels also include peptides or polypeptides that can be detected by antibody binding, e.g., by binding of a detectably labeled antibody or by detection of bound antibody through a sandwich-type assay. Further examples of detectable labels include, but are not limited to, dye labels (e.g., chromophores, fluorophores, such as, but not limited to, Alexa Fluor® fluorescent dyes (e.g., Alexa Fluor® 350, 405, 430, 488, 532, 546, 555, 568, 594, 595, 610, 633, 635, 647, 660, 680, 700, 750, 790, and the like), coumarins, rhodamines (5-carboxyrhodamine and sulfo derivates thereof, e.g., 5-carboxy-disulfo-rhodamine, carbopyranins and oxazines, such as ATTO dyes (e.g., ATTO 390, 425, 465, 488, 495, 520, 532, 550, 565, 590, 594, 610, 611X, 620, 633, 635, 637, 647, 647N, 655, 665, 680, 700, 725 or 740), biophysical probes (spin labels, nuclear magnetic resonance (NMR) probes), Förster Resonance Energy Transfer (FRET)-type labels (e.g., at least one member of a FRET pair, including at least one member of a fluorophore/quencher pair), Bioluminescence Resonance Energy Transfer (BRET)-type labels (e.g., at least one member of a BRET pair), immunodetectable tags (e.g., FLAG, His(6), and the like), localization tags (e.g., to identify association of a tagged polypeptide at the tissue or molecular cell level (e.g., association with a tissue type, or particular cell membrane), and the like.

Attachment of Moieties for Delivery to a Target Site

Embodiments of the present disclosure also include a polypeptide conjugated to one or more moieties, such as, but not limited to, a drug (e.g., a small molecule drug), toxin, or other molecule for delivery to a target site (e.g., a cell) and which can provide for a pharmacological activity or can serve as a target for delivery of other molecules.

Also contemplated are conjugates that include one of a pair of binding partners (e.g., a ligand, a ligand-binding portion of a receptor, a receptor-binding portion of a ligand, etc.). For example, the conjugate can include a polypeptide that serves as a viral receptor and, upon binding with a viral envelope protein or viral capsid protein, facilitates attachment of virus to the cell surface on which the modified polypeptide is expressed. Alternatively, the conjugate may include an antigen that is specifically bound by an antibody (e.g., monoclonal antibody), to facilitate detection and/or separation of host cells expressing the modified polypeptide.

Attachment of Target Molecules to a Support

The methods can provide for conjugation of a polypeptide to a moiety to facilitate attachment of the polypeptide to a solid substrate (e.g., to facilitate assays), or to a moiety to facilitate easy separation (e.g., a hapten recognized by an antibody bound to a magnetic bead). In some embodiments, the methods are used to provide for attachment of a protein to an array (e.g., chip) in a defined orientation. For example, a polypeptide modified at a selected site (e.g., at or near the N-terminus) can be generated, and the methods, conjugates and compounds used to deliver a moiety to the modified polypeptide. The moiety can then be used as the attachment site for affixing the polypeptide to a support (e.g., solid or semi-solid support, such as a support suitable for use as a microchip in high-throughput assays).

Water-Soluble Polymers

In some cases, a conjugate includes a covalently linked water-soluble polymer. A moiety of particular interest is a water-soluble polymer. A “water-soluble polymer” refers to a polymer that is soluble in water and is usually substantially non-immunogenic, and usually has an atomic molecular weight greater than 1,000 Daltons. The methods, conjugates and compounds described herein can be used to attach one or more water-soluble polymers to a polypeptide. Attachment of a water-soluble polymer (e.g., PEG) to a polypeptide, such as a pharmaceutically active (e.g., therapeutic) polypeptide can be desirable as such modification can increase the therapeutic index by increasing serum half-life as a result of increased proteolytic stability and/or decreased renal clearance. Additionally, attachment of one or more polymers (e.g., PEGylation) can reduce immunogenicity of protein pharmaceuticals.

In some embodiments, the water-soluble polymer has an effective hydrodynamic molecular weight of greater than 5,000 Da, greater than 10,000 Da, greater than 20,000 to 500,000 Da, greater than 40,000 Da to 300,000 Da, greater than 50,000 Da to 70,000 Da, such as greater than 60,000 Da. In some embodiments, the water-soluble polymer has an effective hydrodynamic molecular weight of from 10 kDa to 20 kDa, from 20 kDa to 25 kDa, from 25 kDa to 30 kDa, from 30 kDa to 50 kDa, or from 50 kDa to 100 kDa. By “effective hydrodynamic molecular weight” is intended the effective water-solvated size of a polymer chain as determined by aqueous-based size exclusion chromatography (SEC). When the water-soluble polymer contains polymer chains having polyalkylene oxide repeat units, such as ethylene oxide repeat units, each chain can have an atomic molecular weight of 200 Da to 80,000 Da, or 1,500 Da to 42,000 Da, including 2,000 to 20,000 Da. Unless referred to specifically, molecular weight is intended to refer to atomic molecular weight. Linear, branched, and terminally charged water soluble polymers (e.g., PEG) may be used.

Polymers useful as moieties to be attached to a polypeptide can have a wide range of molecular weights, and polymer subunits. These subunits may include a biological polymer, a synthetic polymer, or a combination thereof. Examples of such water-soluble polymers include: dextran and dextran derivatives, including dextran sulfate, P-amino cross linked dextrin, and carboxymethyl dextrin, cellulose and cellulose derivatives, including methylcellulose and carboxymethyl cellulose, starch and dextrines, and derivatives and hydroylactes of starch, polyalklyene glycol and derivatives thereof, including polyethylene glycol, methoxypolyethylene glycol, polyethylene glycol homopolymers, polypropylene glycol homopolymers, copolymers of ethylene glycol with propylene glycol, wherein said homopolymers and copolymers are unsubstituted or substituted at one end with an alkyl group, heparin and fragments of heparin, polyvinyl alcohol and polyvinyl ethyl ethers, polyvinylpyrrolidone, aspartamide, and polyoxyethylated polyols, with the dextran and dextran derivatives, dextrine and dextrine derivatives. It will be appreciated that various derivatives of the specifically recited water-soluble polymers are also contemplated.

Water-soluble polymers such as those described above include polyalkylene oxide based polymers, such as polyethylene glycol “PEG” (See. e.g., “Poly(ethylene glycol) Chemistry: Biotechnical and Biomedical Applications”, J. M. Harris, Ed., Plenum Press, New York, N.Y. (1992); and “Poly(ethylene glycol) Chemistry and Biological Applications”, J. M. Harris and S. Zalipsky, Eds., ACS (1997); and International Patent Applications: WO 90/13540, WO 92/00748, WO 92/16555, WO 94/04193,WO 94/14758, WO 94/17039, WO 94/18247, WO 94/28937, WO 95/11924, WO 96/00080, WO 96/23794, WO 98/07713, WO 98/41562, WO 98/48837, WO 99/30727, WO 99/32134, WO 99/33483, WO 99/53951, WO 01/26692, WO 95/13312, WO 96/21469, WO 97/03106, WO 99/45964, and U.S. Pat. Nos. 4,179,337; 5,075,046; 5,089,261; 5,100,992; 5,134,192; 5,166,309; 5,171,264; 5,213,891; 5,219,564; 5,275,838; 5,281,698; 5,298,643; 5,312,808; 5,321,095; 5,324,844; 5,349,001; 5,352,756; 5,405,877; 5,455,027; 5,446,090; 5,470,829; 5,478,805; 5,567,422; 5,605,976; 5,612,460; 5,614,549; 5,618,528; 5,672,662; 5,637,749; 5,643,575; 5,650,388; 5,681,567; 5,686,110; 5,730,990; 5,739,208; 5,756,593; 5,808,096; 5,824,778; 5,824,784; 5,840,900; 5,874,500; 5,880,131; 5,900,461; 5,902,588; 5,919,442; 5,919,455; 5,932,462; 5,965,119; 5,965,566; 5,985,263; 5,990,237; 6,011,042; 6,013,283; 6,077,939; 6,113,906; 6,127,355; 6,177,087; 6,180,095; 6,194,580; 6,214,966).

Examples of polymers of interest include those containing a polyalkylene oxide, polyamide alkylene oxide, or derivatives thereof, including polyalkylene oxide and polyamide alkylene oxide comprising an ethylene oxide repeat unit of the formula —(CH₂—CH₂—O)—. Further examples of polymers of interest include a polyamide having a molecular weight greater than 1,000 Daltons of the formula —[C(O)—X—C(O)—NH—Y—NH]n- or —[NH—Y—NH—C(O)—X—C(O)]_(n)—, where X and Y are divalent radicals that may be the same or different and may be branched or linear, and n is a discrete integer from 2-100, such as from 2 to 50, and where either or both of X and Y comprises a biocompatible, substantially non-antigenic water-soluble repeat unit that may be linear or branched. Further examples of water-soluble repeat units comprise an ethylene oxide of the formula —(CH₂—CH₂—O)— or —(O—CH₂—CH₂)—. The number of such water-soluble repeat units can vary significantly, with the number of such units being from 2 to 500, 2 to 400, 2 to 300, 2 to 200, 2 to 100, for example from 2 to 50. An example of an embodiment is one in which one or both of X and Y is selected from: —((CH₂)_(n1)—(CH₂—CH₂—O)_(n2)—(CH₂)— or —((CH₂)_(n1)—(O—CH₂—CH₂)_(n2)—(CH₂)_(n-1)—), where n1 is 1 to 6, 1 to 5, 1 to 4, or 1 to 3, and where n2 is 2 to 50, 2 to 25, 2 to 15, 2 to 10, 2 to 8, or 2 to 5. A further example of an embodiment is one in which X is —(CH₂—CH₂)—, and where Y is —(CH₂—(CH₂—CH₂—O)₃—CH₂—CH₂—CH₂)— or —(CH₂—CH₂—CH₂—(O—CH₂—CH₂)₃—CH₂)—.

The polymer can include one or more spacers or linkers. Examples of spacers or linkers include linear or branched moieties comprising one or more repeat units employed in a water-soluble polymer, diamino and or diacid units, natural or unnatural amino acids or derivatives thereof, as well as aliphatic moieties, including alkyl, aryl, heteroalkyl, heteroaryl, alkoxy, and the like, which can contain, for example, up to 18 carbon atoms or even an additional polymer chain.

The polymer moiety, or one or more of the spacers or linkers of the polymer moiety when present, may include polymer chains or units that are biostable or biodegradable. For example, polymers with repeat linkages have varying degrees of stability under physiological conditions depending on bond lability. Polymers with such bonds can be categorized by their relative rates of hydrolysis under physiological conditions based on known hydrolysis rates of low molecular weight analogs, e.g., from less stable to more stable, e.g., polyurethanes (—NH—C(O)—O—)>polyorthoesters (—O—C((OR)(R′))—O—)>polyamides (—C(O)—NH—). Similarly, the linkage systems attaching a water-soluble polymer to a target molecule may be biostable or biodegradable, e.g., from less stable to more stable: carbonate (—O—C(O)—O—)>ester (—C(O)—O—)>urethane (—NH—C(O)—O—)>orthoester (—O—C((OR)(R′))—O—)>amide (—C(O)—NH—). In general, it may be desirable to avoid use of a sulfated polysaccharide, depending on the lability of the sulfate group. In addition, it may be less desirable to use polycarbonates and polyesters. These bonds are provided by way of example, and are not intended to limit the types of bonds employable in the polymer chains or linkage systems of the water-soluble polymers useful in the modified aldehyde tagged polypeptides disclosed herein.

Formulations

The conjugates (including antibody conjugates) of the present disclosure can be formulated in a variety of different ways. In general, where the conjugate is a polypeptide-drug conjugate, the conjugate is formulated in a manner compatible with the drug conjugated to the polypeptide, the condition to be treated, and the route of administration to be used.

The conjugate (e.g., polypeptide-drug conjugate) can be provided in any suitable form, e.g., in the form of a pharmaceutically acceptable salt, and can be formulated for any suitable route of administration, e.g., oral, topical or parenteral administration. Where the conjugate is provided as a liquid injectable (such as in those embodiments where they are administered intravenously or directly into a tissue), the conjugate can be provided as a ready-to-use dosage form, or as a reconstitutable storage-stable powder or liquid composed of pharmaceutically acceptable carriers and excipients.

Methods for formulating conjugates can be adapted from those available in the art. For example, conjugates can be provided in a pharmaceutical composition comprising a therapeutically effective amount of a conjugate and a pharmaceutically acceptable carrier (e.g., saline). The pharmaceutical composition may optionally include other additives (e.g., buffers, stabilizers, preservatives, and the like). In some embodiments, the formulations are suitable for administration to a mammal, such as those that are suitable for administration to a human.

Methods of Treatment

The polypeptide-drug conjugates of the present disclosure find use in treatment of a condition or disease in a subject that is amenable to treatment by administration of the parent drug (i.e., the drug prior to conjugation to the polypeptide). By “treatment” is meant that at least an amelioration of the symptoms associated with the condition afflicting the host is achieved, where amelioration is used in a broad sense to refer to at least a reduction in the magnitude of a parameter, e.g. symptom, associated with the condition being treated. As such, treatment also includes situations where the pathological condition, or at least symptoms associated therewith, are completely inhibited, e.g., prevented from happening, or stopped, e.g. terminated, such that the host no longer suffers from the condition, or at least the symptoms that characterize the condition. Thus treatment includes: (i) prevention, that is, reducing the risk of development of clinical symptoms, including causing the clinical symptoms not to develop, e.g., preventing disease progression to a harmful state; (ii) inhibition, that is, arresting the development or further development of clinical symptoms, e.g., mitigating or completely inhibiting an active disease; and/or (iii) relief, that is, causing the regression of clinical symptoms.

In the context of cancer, the term “treating” includes any or all of: reducing growth of a solid tumor, inhibiting replication of cancer cells, reducing overall tumor burden, and ameliorating one or more symptoms associated with a cancer.

The subject to be treated can be one that is in need of therapy, where the host to be treated is one amenable to treatment using the parent drug. Accordingly, a variety of subjects may be amenable to treatment using the polypeptide-drug conjugates disclosed herein. Generally, such subjects are “mammals”, with humans being of interest. Other subjects can include domestic pets (e.g., dogs and cats), livestock (e.g., cows, pigs, goats, horses, and the like), rodents (e.g., mice, guinea pigs, and rats, e.g., as in animal models of disease), as well as non-human primates (e.g., chimpanzees, and monkeys).

The amount of polypeptide-drug conjugate administered can be initially determined based on guidance of a dose and/or dosage regimen of the parent drug. In general, the polypeptide-drug conjugates can provide for targeted delivery and/or enhanced serum half-life of the bound drug, thus providing for at least one of reduced dose or reduced administrations in a dosage regimen. Thus, the polypeptide-drug conjugates can provide for reduced dose and/or reduced administration in a dosage regimen relative to the parent drug prior to being conjugated in an polypeptide-drug conjugate of the present disclosure.

Furthermore, as noted above, because the polypeptide-drug conjugates can provide for controlled stoichiometry of drug delivery, dosages of polypeptide-drug conjugates can be calculated based on the number of drug molecules provided on a per polypeptide-drug conjugate basis.

In some embodiments, multiple doses of a polypeptide-drug conjugate are administered. The frequency of administration of a polypeptide-drug conjugate can vary depending on any of a variety of factors, e.g., severity of the symptoms, condition of the subject, etc. For example, in some embodiments, a polypeptide-drug conjugate is administered once per month, twice per month, three times per month, every other week (qow), once per week (qw), twice per week (biw), three times per week (tiw), four times per week, five times per week, six times per week, every other day (qod), daily (qd), twice a day (qid), or three times a day (tid).

Methods of Treating Cancer

The present disclosure provides methods for delivering a cancer chemotherapeutic agent to an individual having a cancer. The methods are useful for treating a wide variety of cancers, including carcinomas, sarcomas, leukemias, and lymphomas.

Carcinomas that can be treated using a subject method include, but are not limited to, esophageal carcinoma, hepatocellular carcinoma, basal cell carcinoma (a form of skin cancer), squamous cell carcinoma (various tissues), bladder carcinoma, including transitional cell carcinoma (a malignant neoplasm of the bladder), bronchogenic carcinoma, colon carcinoma, colorectal carcinoma, gastric carcinoma, lung carcinoma, including small cell carcinoma and non-small cell carcinoma of the lung, adrenocortical carcinoma, thyroid carcinoma, pancreatic carcinoma, breast carcinoma, ovarian carcinoma, prostate carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary adenocarcinoma, cystadenocarcinoma, medullary carcinoma, renal cell carcinoma, ductal carcinoma in situ or bile duct carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilm's tumor, cervical carcinoma, uterine carcinoma, testicular carcinoma, osteogenic carcinoma, epithelial carcinoma, and nasopharyngeal carcinoma, etc.

Sarcomas that can be treated using a subject method include, but are not limited to, fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, chordoma, osteogenic sarcoma, osteosarcoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's sarcoma, leiomyosarcoma, rhabdomyosarcoma, and other soft tissue sarcomas.

Other solid tumors that can be treated using a subject method include, but are not limited to, glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, menangioma, melanoma, neuroblastoma, and retinoblastoma.

Leukemias that can be treated using a subject method include, but are not limited to, a) chronic myeloproliferative syndromes (neoplastic disorders of multipotential hematopoietic stem cells); b) acute myelogenous leukemias (neoplastic transformation of a multipotential hematopoietic stem cell or a hematopoietic cell of restricted lineage potential; c) chronic lymphocytic leukemias (CLL; clonal proliferation of immunologically immature and functionally incompetent small lymphocytes), including B-cell CLL, T-cell CLL prolymphocytic leukemia, and hairy cell leukemia; and d) acute lymphoblastic leukemias (characterized by accumulation of lymphoblasts). Lymphomas that can be treated using a subject method include, but are not limited to, B-cell lymphomas (e.g., Burkitt's lymphoma); Hodgkin's lymphoma; non-Hodgkin's B cell lymphoma; and the like.

EXAMPLES

The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the present invention, and are not intended to limit the scope of what the inventors regard as their invention nor are they intended to represent that the experiments below are all or the only experiments performed. Efforts have been made to ensure accuracy with respect to numbers used (e.g. amounts, temperature, etc.) but some experimental errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is weight average molecular weight, temperature is in degrees Celsius, and pressure is at or near atmospheric. By “average” is meant the arithmetic mean. Standard abbreviations may be used, e.g., bp, base pair(s); kb, kilobase(s); pl, picoliter(s); s or sec, second(s); min, minute(s); h or hr, hour(s); aa, amino acid(s); kb, kilobase(s); bp, base pair(s); nt, nucleotide(s); i.m., intramuscular(ly); i.p., intraperitoneal(ly); s.c., subcutaneous(ly); and the like.

General Synthetic Procedures

Many general references providing commonly known chemical synthetic schemes and conditions useful for synthesizing the disclosed compounds are available (see, e.g., Smith and March, March's Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, Fifth Edition, Wiley-Interscience, 2001; or Vogel, A Textbook of Practical Organic Chemistry, Including Qualitative Organic Analysis, Fourth Edition, New York: Longman, 1978).

Compounds as described herein can be purified by any purification protocol known in the art, including chromatography, such as HPLC, preparative thin layer chromatography, flash column chromatography and ion exchange chromatography. Any suitable stationary phase can be used, including normal and reversed phases as well as ionic resins. In certain embodiments, the disclosed compounds are purified via silica gel and/or alumina chromatography. See, e.g., Introduction to Modern Liquid Chromatography, 2nd Edition, ed. L. R. Snyder and J. J. Kirkland, John Wiley and Sons, 1979; and Thin Layer Chromatography, ed E. Stahl, Springer-Verlag, New York, 1969.

During any of the processes for preparation of the subject compounds, it may be necessary and/or desirable to protect sensitive or reactive groups on any of the molecules concerned. This may be achieved by means of conventional protecting groups as described in standard works, such as J. F. W. McOmie, “Protective Groups in Organic Chemistry”, Plenum Press, London and New York 1973, in T. W. Greene and P. G. M. Wuts, “Protective Groups in Organic Synthesis”, Third edition, Wiley, New York 1999, in “The Peptides”; Volume 3 (editors: E. Gross and J. Meienhofer), Academic Press, London and New York 1981, in “Methoden der organischen Chemie”, Houben-Weyl, 4^(th) edition, Vol. 15/1, Georg Thieme Verlag, Stuttgart 1974, in H.-D. Jakubke and H. Jescheit, “Aminosauren, Peptide, Proteine”, Verlag Chemie, Weinheim, Deerfield Beach, and Basel 1982, and/or in Jochen Lehmann, “Chemie der Kohlenhydrate: Monosaccharide and Derivate”, Georg Thieme Verlag, Stuttgart 1974. The protecting groups may be removed at a convenient subsequent stage using methods known from the art.

The subject compounds can be synthesized via a variety of different synthetic routes using commercially available starting materials and/or starting materials prepared by conventional synthetic methods. A variety of examples of synthetic routes that can be used to synthesize the compounds disclosed herein are described in the schemes below.

Example 1

Experiments were performed to synthesize a maytansine derivative for attachment to a hydrazinyl-substituted heteroaryl coupling moiety according to embodiments of the present disclosure. Reactions were performed according to Scheme 1 shown below.

Synthesis of tert-Butyl (S)-1-(9H-fluoren-9-yl)-3,6-dioxo-5-(2-oxo-2-(tritylamino)ethyl)-2,10,13-trioxa-4,7-diazahexadecan-16-oate (Compound 12)

To a dried 20 mL glass scintillation vial containing a dried pea stir bar was added N-(9-fluorenylmethoxycarbonyl)-N-trityl-L-asparagine (11) (597.0 mg, 1.0 mmol), HATU (380.9 mg, 1.0 mmol), and anhydrous DMF (3 mL). The clear, colorless solution was stirred at room temperature for 15 min. 2-Methyl-2-propanyl 3-[2-(2-aminoethoxy)ethoxy]propanoate (10) (238.2 mg, 1.0 mmol), DIPEA (258.5 mg, 348.4 μL, 2.0 mmol), and anhydrous DMF (3 mL) were combined in a separate, dried 4 mL glass scintillation vial and added dropwise, slowly, to the stirring solution. The reaction was stirred at room temperature for 2 h, added to H₂O (100 mL) and 5 M NaCl (25 mL) in a separatory funnel, and extracted with 5×25 mL EtOAc. The organic fractions were combined, washed with H₂O (1×25 mL), 1.2 M NaHCO₃ (1×25 mL), and 5 M NaCl (1×25 mL), dried over Na₂SO₄, concentrated, adsorbed onto a Biotage KP C18 HS 12 g samplet, and purified on a Biotage KP C18 HS 60 g cartridge using a gradient of 0-100% CH₃CN in H₂O, giving compound 12 as a white, crystalline solid (708.4 mg, 87% yield).

MS (ESI) calcd for C₄₉H₅₄N₃O₈ [M+H]⁺: 812.4 found 812.4.

Synthesis of (S)-5-(2-Amino-2-oxoethyl)-1-(9H-fluoren-9-yl)-3,6-dioxo-2,10,13-trioxa-4,7-diazahexadecan-16-oic acid (Compound 13)

To a 20 mL glass scintillation vial containing a stir bar was added a solution of TFA (6.8 mL, 88.8 mmol) and triisopropylsilane (0.2 mL, 0.98 mmol) in 0.2 mL of H₂O. tert-Butyl (S)-1-(9H-fluoren-9-yl)-3,6-dioxo-5-(2-oxo-2-(tritylamino)ethyl)-2,10,13-trioxa-4,7-diazahexadecan-16-oate (12) (708.4 mg, 0.9 mmol) was added in small portions and the reaction was stirred at room temperature for 4 h. The solution was evaporated, adsorbed onto a Biotage KP C18 HS 12 g samplet, and purified on a Biotage KP C18 HS 60 g cartridge using a gradient of 0-100% CH₃CN in H₂O, giving 385.3 mg (86%) of compound 13 as a white solid.

¹H NMR (DMSO-d6, 400 MHz) δ 7.95-7.86 (m, 2H), 7.83 (t, J=5.6 Hz, 1H), 7.75-7.68 (m, 2H), 7.50 (d, J=8.4 Hz, 1H), 7.45-7.39 (m, 2H), 7.37-7.25 (m, 3H), 6.90 (s, 1H), 4.35-4.20 (m, 4H), 3.61-3.56 (m, 2H), 3.50-3.42 (m, 4H), 3.41-3.37 (m, 2H), 3.24-3.16 (m, 2H), 2.47-2.38 (m, 4H).

MS (ESI) calcd for C₂₆H₃₂N₃O₈ [M+H]⁺: 514.2 found 514.2.

Synthesis of (1⁴S,1⁶S,3²R,3³R,2R,4S,10E,12E,14R)-8⁶-Chloro-1⁴-hydroxy-8⁵,14-dimethoxy-3³,2,7,10-tetramethyl-1²,6-dioxo-7-aza-1(6,4)-oxazinana-3(2,3)-oxirana-8(1,3)-benzenacyclotetradecaphane-10,12-dien-4-yl (2S,15S)-15,17-diamino-2,3-dimethyl-4,14,17-trioxo-7,10-dioxa-3,13-diazaheptadecanoate (Compound 15)

To a dried 4 mL glass scintillation vial containing a dried stir bar was added (S)-5-(2-amino-2-oxoethyl)-1-(9H-fluoren-9-yl)-3,6-dioxo-2,10,13-trioxa-4,7-diazahexadecan-16-oic acid (13) (98.6 mg, 0.2 mmol), HATU (72.9 mg, 0.2 mmol), and anhydrous DMF (1 mL). The clear, colorless solution was stirred at room temperature for 15 min. Deacyl maytansine (14) (122.1 mg, 0.2 mmol), DIPEA (72.8 mg, 98.1 μL, 0.6 mmol), and anhydrous DMF (1 mL) were combined in a separate, dried 4 mL glass scintillation vial and added dropwise, slowly, to the stirring solution. The reaction was allowed to stir at room temperature for 2 h, adsorbed directly onto a Biotage KP C18 HS 3 g samplet, and purified on a Biotage KP C18 HS 30 g cartridge using a gradient of 0-100% CH₃CN in H₂O, giving compound 15 as a white solid (149.1 mg, 69% yield).

MS (ESI) calcd for C₄₃H₆₄ClN₆O₁₄ [M+H]⁺: 923.4 found 923.4.

Example 2

Experiments were performed to synthesize a maytansine modified to include a hydrazinyl-substituted heteroaryl coupling moiety (e.g., a hydrazinyl-substituted imidazolyl coupling moiety) according to embodiments of the present disclosure. Reactions were performed according to Scheme 2 shown below.

Synthesis of 1H-Imidazole-5-carbaldehyde (Compound 21)

A solution of 1.0 g (0.01 mol) of (1H-imidazol-5-yl)methanol (20) in 10.0 mL of MeOH was added 8.86 g (0.10 mol) of MnO₂. The mixture was stirred at room temperature overnight and filtered. The solvent was evaporated to afford 0.50 g (51%) of compound 21 as a white solid.

¹H NMR (MeOD, 400 MHz) δ 9.78 (s, 1H), 7.91 (s, 1H), 7.89 (s, 1H).

Synthesis of Methyl 3-(5-formyl-1H-imidazol-1-yl)propanoate (Compound 23)

To a solution of 0.25 g (0.003 mol) of 1H-imidazole-5-carbaldehyde (21) and 1.12 g (0.013 mol) of methylacrylate (22) in 5.0 mL of DMF was added 0.396 g (0.003 mol) of DBU. The reaction mixture was stirred at room temperature overnight and concentrated. The product was isolated by flash chromatography on silica gel to afford 0.15 g of compound 23 as an oil.

¹H NMR (CDCl₃, 400 MHz) δ 9.86 (s, 1H), 7.67 (s, 1H), 7.61 (s, 1H), 4.34 (t, J=6.4 Hz, 2H), 3.71 (s, 3H), 2.83 (t, J=6.4 Hz, 2H).

MS (ESI) calcd for C₈H₁₁N₂O₃ [M+H]⁺: 183.1 found 183.2.

Synthesis of 3-(5-Formyl-1H-imidazol-1-yl)propanoic acid (Compound 24)

A solution of 0.900 g (0.005 mol) of methyl 3-(5-formyl-1H-imidazol-1-yl)propanoate (23) and 0.622 g (0.015 mol) of LiOH in 20 mL of THF/H₂O (1:1) was stirred at room temperature for 2 h. The reaction mixture was concentrated and the product was isolated by preparative HPLC to afford 0.800 g (96%) of compound 24 as a colorless oil.

¹H NMR (DMSO-d₆, 400 MHz) δ 9.87 (s, 1H), 8.74 (s, 1H), 8.42 (s, 1H), 4.37 (t, J=6.6 Hz, 2H), 2.92 (t, J=6.6 Hz, 2H).

Synthesis of 3-(5-((2-(((9H-fluoren-9-yl)methoxy)carbonyl)-1,2-dimethylhydrazinyl)methyl)-1H-imidazol-1-yl)propanoic acid (Compound 26)

To a solution of 0.065 g (0.387 mmol) of 3-(5-formyl-1H-imidazol-1-yl)propanoic acid (24) in 2.0 mL of DCE was added 0.120 g (0.425 mmol) of (9H-fluoren-9-yl)methyl 1,2-dimethylhydrazine-1-carboxylate (25) (see: Eur. J. Med. Chem. 2014, 88, 3) and 50 mg of 4 Å molecular sieves. The mixture was stirred at room temperature for 30 min and 0.098 g of NaBH(OAc)₃ was added. The reaction mixture was stirred at room temperature for 24 h, then diluted with DCM (5 mL) and extracted with H₂O. The organic layer was washed with 0.1 M HCl, dried over Na₂SO₄, filtered and the solvent was removed in vacuo. The residue was purified on silica gel to afford 0.020 g (12%) of compound 26 as a white solid.

¹H NMR (CDCl₃, 400 MHz) δ 8.83 (bs, 1H), 7.76 (d, J=7.6 Hz, 2H), 7.55 (bs, 3H), 7.39 (dd (app. t), J=7.2 Hz, 2H), 7.32 (m, 3H), 4.9-4.0 (m, 7H), 3.20 (m, 1H), 2.85-2.50 (m, 7H), 2.04 (m, 1H).

MS (ESI) calcd for C₂₄H₂₇N₄O₄ [M+H]⁺: 435.2 found 435.4.

Synthesis of (1⁴S,1⁶S,3²R,3³R,2R,4S,10E,12E,14R)-8⁶-Chloro-1⁴-hydroxy-8⁵,14-dimethoxy-3³,2,7,10-tetramethyl-1²,6-dioxo-7-aza-1(6,4)-oxazinana-3(2,3)-oxirana-8(1,3)-benzenacyclotetradecaphane-10,12-dien-4-yl (2S,15S)-19-(4-((2-(((9H-fluoren-9-yl)methoxy)carbonyl)-1,2-dimethylhydrazinyl)methyl)-1H-imidazol-1-yl)-15-(2-amino-2-oxoethyl)-2,3-dimethyl-4,14,17-trioxo-7,10-dioxa-3,13,16-triazanonadecanoate (Compound 27)

To a solution of 21.7 mg (0.049 mmol) 3-(5-((2-(((9H-fluoren-9-yl)methoxy)carbonyl)-1,2-dimethylhydrazinyl)methyl)-1H-imidazol-1-yl)propanoic acid (26) in 300 μL of DMF was added 45.0 mg (0.049 mmol) of maytansinoid 15, 13 μl (0.098 mmol) of 2,4,6-trimethylpyridine, and 18.5 mg (0.049 mmol) of HATU. Monitoring by HPLC, the reaction was stirred until the starting materials were consumed. The crude reaction mixture was purified by reversed phase (C18) flash chromatography using a gradient of 0-100% MeCN:water with 0.1% formic acid as an additive to afford 50.2 mg of compound 27 in 77% yield.

MS (ESI) calcd for C₆₇H₈₈ClN₁₀O₁₇ [M+H]⁺: 1339.6; found 1339.5.

Synthesis of (1⁴S,1⁶S,3²R,3³R,2R,4S,10E,12E,14R)-8⁶-Chloro-1⁴-hydroxy-8⁵,14-dimethoxy-3³,2,7,10-tetramethyl-1²,6-dioxo-7-aza-1(6,4)-oxazinana-3(2,3)-oxirana-8(1,3)-benzenacyclotetradecaphane-10,12-dien-4-yl (2S,15S)-15-(2-amino-2-oxoethyl)-19-(4-((1,2-dimethylhydrazinyl)methyl)-1H-imidazol-1-yl)-2,3-dimethyl-4,14,17-trioxo-7,10-dioxa-3,13,16-triazanonadecanoate (Compound 28)

To a solution of 50.2 mg (0.038 mmol) of compound 27 in 300 μL of DMF was added 74 μL of piperidine. The reaction mixture was allowed to stand for 30 min and purified by reversed phase (C18) flash chromatography using a gradient of 0-100% MeCN:water. The product was collected in test tubes containing 3 mL of 0.1% formic acid in water. The solvent was removed in vacuo to afford 30.3 mg (71%) of compound 28 as the formate salt.

MS (ESI) calcd for C₅₂H₇₈ClN₁₀O₁₅ [M+H]⁺: 1117.5; found 1117.4.

Example 3

Experiments were performed to synthesize a maytansine modified to include a hydrazinyl-substituted heteroaryl coupling moiety (e.g., a hydrazinyl-substituted pyrrolyl coupling moiety) according to embodiments of the present disclosure. Reactions were performed according to Scheme 3 shown below.

Synthesis of methyl 3-(3-Formyl-1H-pyrrol-1-yl)propanoate (Compound 32)

To a solution of 1H-pyrrole-3-carbaldehyde (30) (500 mg, 5.26 mmol) in CH₃CN (10 mL) was added methyl acrylate (31) (3.8 mL, 42.08 mmol) and DBU (0.8 mL, 5.26 mmol). The resulting mixture was stirred at reflux overnight. The reaction mixture was cooled to room temperature, poured into ice-water and extracted with EtOAc. The combined organic layer was washed with brine, dried over Na₂SO₄, filtered and concentrated. The residue was purified by silica gel column chromatography (PE/EA=2:1) to give 800 mg (84%) of compound 32.

¹H NMR (CDCl₃, 400 MHz) δ 9.72 (s, 1H), 7.33 (s, 1H), 7.00 (s, 1H), 6.61 (s, 1H), 4.25 (t, J=6.6 Hz, 2H), 3.70 (s, 3H), 2.81 (t, J=6.6, Hz, 2H).

Synthesis of 3-(3-Formyl-1H-pyrrol-1-yl)propanoic acid (Compound 33)

To a solution of methyl 3-(3-formyl-1H-pyrrol-1-yl)propanoate (32) (800 mg, 4.42 mmol) in THF/H₂O=1:1 (10 mL) was added LiOH.H₂O (557 mg, 13.26 mmol). The resulting mixture was stirred at room temperature for 0.5 h. The pH was then adjusted to ˜3 with 2 N HCl and extracted with EtOAc. The combined organic layer was dried over Na₂SO₄, filtered and concentrated to give 650 mg (88%) of compound 33 as a yellow solid.

¹H NMR (DMSO-d₆, 400 MHz) δ 12.44 (s, 1H), 9.62 (s, 1H), 7.64 (s, 1H), 6.94 (s, 1H), 6.43 (s, 1H), 4.17 (t, J=6.8 Hz, 2H), 2.77 (t, J=6.8 Hz, 2H).

MS (ESI) calcd for C₈H₁₀NO₃ [M+H]⁺: 168.1; found 168.1.

Synthesis of 3-(3-((2-(((9H-Fluoren-9-yl)methoxy)carbonyl)-1,2-dimethylhydrazinyl)methyl)-1H-pyrrol-1-yl)propanoic acid (Compound 35)

To a solution of 3-(3-formyl-1H-pyrrol-1-yl)propanoic acid (33) (650 mg, 3.89 mmol) in THF (15 mL) was added (9H-fluoren-9-yl)methyl 1,2-dimethylhydrazine-1-carboxylate (34) (see: Eur. J. Med. Chem. 2014, 88, 3) (1.7 g, 6.03 mmol) and molecular sieves (1 g). The mixture was stirred at room temperature for 30 min. Then, NaBH(OAc)₃ (1.65 g, 7.78 mmol) was added in several portions at 0° C. The resulting mixture was stirred at room temperature overnight, then it was concentrated and crudely purified by silica gel column chromatography (DCM/MeOH=10:1) to give the crude product which was further purified by prep-HPLC to give 300 mg (18%) of compound 35.

¹H NMR (DMSO-d₆, 400 MHz) δ 7.90 (d, j=7.2 Hz, 2H), 7.80-7.50 (b, 2H), 7.41 (dd(app. t), J=7.4 Hz, 2H), 7.34 (dd(app. t), J=7.2 Hz, 2H), 6.58 (bs, 1H), 6.31 (bs, 1H), 5.87-5.70 (m, 1H), 4.60-4.40 (b, 3H), 4.28 (bs, 2H), 3.95 (t, J=6.8 Hz, 2H), 3.75 (b, 1H), 3.13 (bs, 1H), 2.66 (s, 3H), 2.49 (s, 3H).

MS (ESI) calcd for C₂₅H₂₈N₃O₄ [M+H]⁺: 434.2; found 434.0.

Synthesis of (1⁴S,1⁶S,3²R,3³R,2R,4S,10E,12E,14R)-8⁶-Chloro-1⁴-hydroxy-8⁵,14-dimethoxy-3³,2,7,10-tetramethyl-1²,6-dioxo-7-aza-1(6,4)-oxazinana-3(2,3)-oxirana-8(1,3)-benzenacyclotetradecaphane-10,12-dien-4-yl (2S,15S)-19-(3-((2-(((9H-fluoren-9-yl)methoxy)carbonyl)-1,2-dimethylhydrazinyl)methyl)-1H-pyrrol-1-yl)-15-(2-amino-2-oxoethyl)-2,3-dimethyl-4,14,17-trioxo-7,10-dioxa-3,13,16-triazanonadecanoate (Compound 36)

To a solution of 17.7 mg (0.019 mmol) of 3-(3-((2-(((9H-fluoren-9-yl)methoxy)carbonyl)-1,2-dimethylhydrazinyl)methyl)-1H-pyrrol-1-yl)propanoic acid (35) in 200 μL of DMF was added 17.7 mg (0.019 mmol) of maytansinoid 15, 5.1 μL (0.038 mmol) of 2,4,6-trimethylpyridine, and 7.3 mg (0.019 mmol) of HATU. Monitoring by HPLC, the reaction was stirred until the starting materials were consumed, approximately 1 h. The crude reaction mixture was purified by reversed phase (C18) flash chromatography using a gradient of 0-100% MeCN:water with 0.1% formic acid as an additive which afforded 25.7 mg (99%) of compound 36.

MS (ESI) calcd for C₆₈H₈₉ClN₉O₁₇ [M+H]⁺: 1338.6; found 1338.8.

Synthesis of (1⁴S,1⁶S,3²R,3³R,2R,4S,10E,12E,14R)-8⁶-Chloro-1⁴-hydroxy-8⁵,14-dimethoxy-3³,2,7,10-tetramethyl-1²,6-dioxo-7-aza-1(6,4)-oxazinana-3(2,3)-oxirana-8(1,3)-benzenacyclotetradecaphane-10,12-dien-4-yl (2S,15S)-15-(2-amino-2-oxoethyl)-19-(3-((1,2-dimethylhydrazinyl)methyl)-1H-pyrrol-1-yl)-2,3-dimethyl-4,14,17-trioxo-7,10-dioxa-3,13,16-triazanonadecanoate (Compound 37)

To a solution of 25.7 mg (0.019 mmol) of compound 36 in 150 μL of DMF was added 30 μL of piperidine and the reaction mixture was maintained for 30 min. The crude reaction was purified by reversed phase (C18) flash chromatography using a 0-100% MeCN:water gradient to afford 17.4 mg (82%) of the the title compound.

MS (ESI) calcd for C₅₃H₇₉ClN₉O₁₅ [M+H]⁺: 1116.5; found 1116.4.

Example 4

Experiments were performed to synthesize a maytansine modified to include a hydrazinyl-substituted heteroaryl coupling moiety (e.g., a hydrazinyl-substituted furanyl coupling moiety) according to embodiments of the present disclosure. Reactions were performed according to Scheme 4 shown below.

Synthesis of 4-(Hydroxymethyl)furan-3-carbaldehyde (Compound 41)

A solution of furan-3,4-diyldimethanol (40) (0.50 g, 3.9 mmol) in 10 mL of anhydrous DCM was placed in a 50 mL round-bottom flask equipped with a stirring bar. To this solution, a suspension of Dess-Martin periodinane (0.83 g, 1.96 mmol) in 5 mL of anhydrous DCM was added slowly over five minutes at room temperature. The resulting yellow solution was allowed to stir for 1 h at room temperature, then quenched with 10 mL of saturated sodium hydrogen sulfite and extracted with EtOAc (2×25 mL). The combined organics were washed with saturated NaHCO₃, followed by water and brine and dried over sodium sulfate. After removal of solvents under vacuum, the residue was purified on silica gel using 10-30% EtOAc in hexanes as eluent to afford 0.095 g (39%) of compound 41 as a white solid.

¹H NMR (400 MHz, CDCl₃) δ ppm 9.97 (s, 1H), 8.13 (s, 1H), 7.46 (s, 1H), 4.64 (s, 2H), 3.70 (br s, 1H).

MS (ESI) calcd for C₆H₇O₃ [M+H]⁺: 127.0 found 127.1.

Synthesis of (9H-Fluoren-9-yl)methyl 2-((4-(hydroxymethyl)furan-3-yl)methyl)-1,2-dimethylhydrazine-1-carboxylate (Compound 43)

To an oven-dried 3 mL scintillation vial equipped with a stirring bar was added 4-(hydroxymethyl)furan-3-carbaldehyde (41) (13 mg, 0.1 mmol), (9H-fluoren-9-yl)methyl 1,2-dimethylhydrazine-1-carboxylate (42) (28 mg, 0.1 mmol), followed by 2 mL of anhydrous DCE and 50 mg of 4 Å molecular sieves. The resulting mixture was vigorously stirred at room temperature for 15 minutes, and treated with NaBH(OAc)₃ (42 mg, 0.2 mmol) in one portion at room temperature. Stirring continued for 2 hours at room temperature until the reaction was judged complete by TLC analysis. The product was isolated by silica gel column chromatography (20-50% EtOAc/hexanes) to afford 34 mg (87%) compound 43 as a colorless oil.

MS (ESI) calcd for C₂₃H₂₅N₂O₄ [M+H]⁺: 393.2 found 393.2.

Synthesis of (9H-Fluoren-9-yl)methyl 1,2-dimethyl-2-((4-((((4-nitrophenoxy)carbonyl)oxy)methyl)furan-3-yl)methyl)hydrazine-1-carboxylate (Compound 45)

In an oven-dried 3 mL vial, a solution of (9H-fluoren-9-yl)methyl 2-((4-(hydroxymethyl)furan-3-yl)methyl)-1,2-dimethylhydrazine-1-carboxylate (43) (12 mg, 30 μmol) in 1 mL of anhydrous DMF was treated with DIPEA (11 μL, 60 mol), followed by bis(4-nitrophenyl)carbonate (44) (14 mg, 45 μmol) in one portion at room temperature. The resulting yellow solution was briefly vortexed and allowed to stand for 24 hours at room temperature. The reaction mixture was diluted with 1 mL of DCM and directly purified on silica gel, eluted with 10-20% EtOAc/hexanes to give 8.0 mg (48%) of compound 45 as a clear, colorless oil.

MS (ESI) calcd for C₃₀H₂₈N₃O₈ [M+H]⁺: 558.2 found 558.2.

Synthesis of (1⁴S,1⁶S,3²R,3³R,2R,4S,10E,12E,14R)-8⁶-chloro-1⁴-hydroxy-8⁵,14-dimethoxy-3³,2,7,10-tetramethyl-1²,6-dioxo-7-aza-1(6,4)-oxazinana-3(2,3)-oxirana-8(1,3)-benzenacyclotetradecaphane-10,12-dien-4-yl (5S,18S)-1-(4-((2-(((9H-fluoren-9-yl)methoxy)carbonyl)-1,2-dimethylhydrazinyl)methyl)furan-3-yl)-5-(2-amino-2-oxoethyl)-17,18-dimethyl-3,6,16-trioxo-2,10,13-trioxa-4,7,17-triazanonadecan-19-oate (Compound 46)

To a 3 mL vial was added maytansinoid 15 (13 mg, 14 mol), followed by 0.5 mL of anhydrous DMF and 5 μL (30 μmol) of DIPEA. To this mixture, a solution of (9H-fluoren-9-yl)methyl 1,2-dimethyl-2-((4-((((4-nitrophenoxy)carbonyl)oxy)methyl)furan-3-yl)methyl)hydrazine-1-carboxylate (45) (8 mg, 14 μmol) in 0.5 mL of anhydrous DMF was added in one portion at room temperature. The resulting solution was allowed to stand at room temperature for 36 hours, then purified by reversed phase chromatography (C18, 10-90% CH₃CN/water) to afford 8.5 mg (44%) of compound 46 as a colorless oil.

MS (ESI) calcd for C₆₇H₈₅ClN₈NaO₁₉ [M+Na]⁺: 1363.6 found 1363.5.

Synthesis of (1⁴S,1⁶S,3²R,3³R,2R,4S,10E,12E,14R)-8⁶-chloro-1⁴-hydroxy-8⁵,14-dimethoxy-3³,2,7,10-tetramethyl-1²,6-dioxo-7-aza-1(6,4)-oxazinana-3(2,3)-oxirana-8(1,3)-benzenacyclotetradecaphane-10,12-dien-4-yl (5S,18S)-5-(2-amino-2-oxoethyl)-1-(4-((1,2-dimethylhydrazinyl)methyl)furan-3-yl)-17,18-dimethyl-3,6,16-trioxo-2,10,13-trioxa-4,7,17-triazanonadecan-19-oate (Compound 47)

A solution of compound 46 (8 mg, 6 μmol) in 1 mL of anhydrous DMA was treated with 13 μL (0.12 mmol) of piperidine at room temperature. The reaction mixture was briefly vortexed and let stand at room temperature for 20 minutes until deprotection was judged complete by HPLC analysis. The product was isolated by reversed phase chromatography (C18, 0-50% CH₃CN/water gradient) to afford 4.0 mg (60%) of compound 47 as a colorless film.

MS (ESI) calcd for C₅₂H₇₆ClN₈O₁₇ [M+H]⁺: 1119.5 found 1119.5.

Example 5

General Procedure for Conjugation of D drug to an Aldehyde-Tagged Antibody

To conjugate a hydrazinyl-substituted heteroaryl-modified drug to an aldehyde-tagged antibody as described herein, the following general protocol was used. 1.0 mM hydrazinyl-substituted heteroaryl-modified drug (e.g., hydrazinyl-substituted imidazolyl coupling moiety-linker-Maytansine) was reacted with 13.5 μM aldehyde-tagged antibody (e.g., an aldehyde-tagged antibody (dimer) with one aldehyde tag per chain) in 5 mM sodium citrate, pH 5.5, 50 mM NaCl with 1% DMA at 37° C. for 16-24 hours. After the reaction was complete, unreacted drug was removed using 3 dilution and concentration steps in a 0.5 mL 10 MWCO Amicon spin filter. The sample was analyzed using hydrophobic interaction chromatography (HIC) to determine the drug to antibody ratio DAR (Tosoh #14947; Mobile Phase A: 25 mM NaPO4, 1.5 M NH4 SO4, pH 7.0; Mobile Phase B: 18.75 mM NaPO4, 25% IPA, pH 7.0).

FIG. 1 shows the HIC trace (min. vs. mAU) of the antibody-drug conjugates produced by conjugating the aldehyde-tagged antibody to the maytansine modified to include a hydrazinyl-substituted imidazolyl coupling moiety, as described in the example above. As shown in FIG. 1, the product included unconjugated (3.673 min.), mono-conjugated (4.833 min.), and di-conjugated (5.903 min.) antibody-drug conjugates.

While the present invention has been described with reference to the specific embodiments thereof, it should be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the true spirit and scope of the invention. In addition, many modifications may be made to adapt a particular situation, material, composition of matter, process, process step or steps, to the objective, spirit and scope of the present invention. All such modifications are intended to be within the scope of the claims appended hereto. 

1. A conjugate comprising at least one modified amino acid residue of formula (I):

wherein: R¹ and R² are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; one of R³ and R⁴ is a polypeptide and the other is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl; Z¹ is selected from CR⁷, N, O and S; Z² is C or N; R⁷ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; L is a linker; and W is a drug or a detectable label. 2.-16. (canceled)
 17. A compound of formula (II):

wherein: R⁸ is H and R⁹ is —(CR²R⁴)(NR⁶)(NHR⁵), or R⁸ is —(CR¹R³)(NR⁵)(NHR⁶) and R⁹ is H; R¹, R², R³ and R⁴ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; R⁵ and R⁶ are each independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or R⁵ and R⁶ are cyclically linked to form a 5 or 6-membered heterocyclyl; Z¹ is selected from CR⁷, N, O and S; Z² is C or N; R⁷ is selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; L is a linker; and W is a drug or a detectable label.
 18. The compound of claim 17, wherein Z¹ and Z² are each N.
 19. The compound of claim 17, wherein Z¹ is CR⁷ and Z² is N.
 20. The compound of claim 17, wherein Z¹ is O and Z² is C.
 21. The compound of claim 17, wherein R⁵ and R⁶ are each independently selected from alkyl and substituted alkyl.
 22. The compound of claim 17, wherein R⁷ is hydrogen.
 23. The compound of claim 17, wherein: the linker is of the formula -(T¹-V¹)_(a)-(T²-V²)b-(T³-V³)_(c)-(T⁴-V⁴)_(d)-(T⁵-V⁵)_(e)-, wherein a, b, c, d and e are each independently 0 or 1, where the sum of a, b, c, d and e is 1 to 5; T¹, T², T³, T⁴ and T⁵ are each independently selected from (C₁-C₁₂)alkyl, substituted (C₁-C₁₂)alkyl, (EDA)_(w), (PEG)_(n), (AA)_(p), —(CR¹³OH)_(h)—, 4-amino-piperidinyl (4AP), para-aminobenzyl (PAB), para-amino-benzyloxy (PABO), meta-amino-benzyloxy (MABO), para-amino-benzyloxycarbonyl (PABC), meta-amino-benzyloxycarbonyl (MABC), an acetal group, a disulfide, a hydrazine, a carbohydrate, a beta-lactam, an ester, (AA)_(p)-MABO, (AA)_(p)-MABC, (AA)_(p)-PABO, (AA)_(p)-PABC, MABO-(AA)_(p), MABC-(AA)_(p), PABO-(AA)_(p), PABC-(AA)_(p), (AA)_(p)-MABO-(AA)_(p), (AA)_(p)-MABC-(AA)_(p), (AA)_(p)-PABO-(AA)_(p), and (AA)_(p)-PABC-(AA)_(p); V¹, V², V³, V⁴ and V⁵ are each independently selected from the group consisting of a covalent bond, —CO—, —NR¹¹—, —CONR¹¹—, —NR¹¹CO—, —C(O)O—, —OC(O)—, —O—, —S—, —S(O)—, —SO₂—, —SO2NR¹¹—, —NR¹¹SO₂— and —P(O)OH—; each R¹¹ and R¹³ are independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; w is an integer from 1 to 20; n is an integer from 1 to 30; p is an integer from 1 to 20; and h is an integer from 1 to
 12. 24. The compound of claim 23, wherein: EDA is an ethylene diamine having the structure:

wherein q is an integer from 1 to 6, r is 0 or 1, and each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or wherein two adjacent R¹² groups are cyclically linked to form a piperazinyl ring; PEG is a polyethylene glycol or a substituted polyethylene glycol; AA is an amino acid residue; and 4AP is

wherein each R¹⁴ is independently selected from hydrogen, alkyl, substituted alkyl, polyethylene glycol, substituted polyethylene glycol, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.
 25. The compound of claim 24, wherein each R¹¹ is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.
 26. The compound of claim 24, wherein each R¹² is independently selected from hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl, or wherein any two adjacent R¹² groups are cyclically linked to form a piperazinyl ring.
 27. The compound of claim 24, wherein each R¹³ is independently selected from hydrogen, alkyl, substituted alkyl, aryl, and substituted aryl.
 28. The compound of claim 24, wherein each R¹⁴ is independently selected from hydrogen, alkyl, substituted alkyl, polyethylene glycol, substituted polyethylene glycol, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.
 29. The compound of claim 24, wherein T¹, T², T³,T⁴ and T⁵ and V¹, V², V³, V⁴ and V⁵ are selected from the following table: T¹ V¹ T² V² T³ V³ T⁴ V⁴ T⁵ V⁵ (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —NR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (EDA)_(w) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) — — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —SO₂— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (CR¹³OH)_(h) —CO— — — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— — — (C₁-C₁₂)alkyl —CONR¹¹— substituted —NR¹¹— (PEG)_(n) —CO— — — — — (C₁-C₁₂)alkyl (C₁-C₁₂)alkyl —SO2— (C₁-C₁₂)alkyl —CO— — — — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl — (CR¹³OH)_(h) —CONR¹¹— — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —P(O)OH— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (AA)_(p) — — — — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (C₁-C₁₂)alkyl —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— — — — — (C₁-C₁₂)alkyl —CONR¹¹— (C₁-C₁₂)alkyl —NR¹¹— — —CO— (C₁-C₁₂)alkyl —NR¹¹— — — (C₁-C₁₂)alkyl —CO— (EDA)_(w) —CO— (CR¹³OH)_(h) —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— — — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- —NR¹¹— (C₁-C₁₂)alkyl —CO— PABC (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— (AA)_(p)- — — — PABC- (AA)_(p) (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABO — — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABO — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABO — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — MABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC — (AA)_(p) — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABO — — — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABO — — — (C₁-C₁₂)alkyl —CO— (AA)_(p) —NR¹¹— (PEG)_(n) —CO— PABC — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— MABC — (AA)_(p) — — — (C₁-C₁₂)alkyl —CONR¹¹— (PEG)_(n) —CO— (AA)_(p) — PABC —NR¹¹— — — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC — (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — PABC- — (AA)_(p) (C₁-C₁₂)alkyl —CO— 4AP —CO— (C₁-C₁₂)alkyl —CO— (AA)_(p) — — —.


30. The compound of claim 24, wherein the linker is selected from one of the following structures:

wherein: each f is independently 0 or an integer from 1 to 12; each n is independently 0 or an integer from 1 to 30; each y is independently 0 or an integer from 1 to 20; each h is independently 0 or an integer from 1 to 12; each p is independently 0 or an integer from 1 to 20; each R is independently selected from hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl; and each R′ is independently selected from hydrogen, a sidechain group of an amino acid, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, amino, substituted amino, carboxyl, carboxyl ester, acyl, acyloxy, acyl amino, amino acyl, alkylamide, substituted alkylamide, sulfonyl, thioalkoxy, substituted thioalkoxy, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclyl, and substituted heterocyclyl.
 31. A method of producing a polypeptide conjugate, the method comprising: combining in a reaction mixture: a compound of claim 17, and a polypeptide comprising a reactive aldehyde group or a reactive ketone group, wherein the combining is under reaction conditions suitable to promote reaction between the compound and the reactive aldehyde group or reactive ketone group of the polypeptide to form a polypeptide conjugate; and isolating the polypeptide conjugate from the reaction mixture.
 32. A pharmaceutical composition comprising: a conjugate of claim 1; and a pharmaceutically acceptable excipient.
 33. A method of delivering a conjugate to a subject, the method comprising: administering to the subject an effective amount of a conjugate of claim
 1. 34. A method of treating a condition in a subject, the method comprising: administering to the subject having the condition a therapeutically effective amount of a pharmaceutical composition comprising a conjugate of claim 1, wherein the administering is effective to treat the condition in the subject. 